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We live in a world full of scientific and technological advances. In recent years it has become quite difficult 
not to notice the proliferation of something called computer graphics. Almost eveiy computer system is set up to 
allow tlie user to interact with the system through a graphical user interface, where information on the display 
screen is conveyed in both textual and graphical forms. Movies and video games are popular showcases of the 
latest technology for people, both young and old. Watching the TV for a while, the likelihood is that you will 
see the magic touch of computer graphics in a commercial. 

This book is both a self-contained text and a valuable study aid on the fundamental principles of computer 
graphics. It takes a goal-oriented approach to discuss the important concepts, the underlying mathematics, and 
the algorithmic aspects of the computerized image synthesis process. It contains hundreds of solved problems 
that help reinforce one’s understanding of the field and exemplify effective problem-solving techniques. 

Although the primary audience are college students taking a computer graphics course in a computer 
science or computer engineering program, any educated person with a desire to look into the inner workings of 
computer graphics should be able to learn from this concise introduction. The recommended prerequisites are 
some working knowledge of a computer system, the equivalent of one or two semesters of programming, a basic 
understanding of data structures and algorithms, and a basic knowledge of linear algebra and analytical 
geometry. 

The field of computer graphics is characterized by rapid changes in how the technology is used in everyday 
applications and by constant evolution of graphics systems. The life span of graphics hardware seems to be 
getting shorter and shorter. An industry standard for computer graphics often becomes obsolete before it is 
finalized. A programming language that is a popular vehicle for graphics applications when a student begins his 
or her college study is likely to be on its way out by the time he or she graduates. 

In this book we try to cover the key ingredients of computer graphics that tend to have a lasting value (only 
in relative terms, of course). Instead of compiling highly equipment-specific or computing environment-specific 
information, we strive to provide a good explanation of the fundamental concepts and the relationship between 
them. We discuss subject matters hi the overall framework of computer graphics and emphasize mathematical 
and/or algorithmic solutions. Algorithms are presented ui pseudo-code rather than a particular programming 
language. Examples are given with specifics to the extent that they can be easily made into working versions on 
a particular computer system. 

W'c believe that this approach brings unique benefit to a diverse group of readers. First, the book can be read 
by itself as a general introduction to computer graphics tor people who want technical substance but not the 
burden of implementational overhead. Second, it can be used by instructors and students as a resource book to 
supplement any comprehensive primary text. Third, it may serve as a stepping-stone for practitioners who want 
something that is more understandable than their graphics system’s programmer’s manuals. 

The first edition of this book has served its audience well for over a decade. I would like to salute and thank 
my coauthors for then invaluable groundwork. The current version represents a significant revision to the 
original, with several chapters replaced to cover new topics, and the remaining material updated throughout the 
rest of the book. I hope that it can serve our future audience as well for years to come. 

Thank you for choosing our book. May you find it stimulating and rewarding. 


Zhigang Xiang 
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Introduction 


Computer graphics is generally regarded as a branch of computer science that deals with the theory and 
technology for computerized image synthesis. A computer-generated image can depict a scene as simple as 
the outline of a triangle on a uniform background and as complex as a magnificent dinosaur in a tropical 
forest. But how do these things become part of the picture? What makes drawing on a computer different 
from sketching with a pen or photographing with a camera? In this chapter we will introduce some 
important concepts and outline the relationship among these concepts. The goal of such a mini-survey of 
the field of computer graphics is to enable us to appreciate the various answers to these questions that we 
will detail in the rest of the book not only in their own right but also in the context of the overall 
framework. 


1.1 A MINI-SURVEY 

First let’s consider drawing the outline of a triangle (see Fig. 1-1 ). In real life this would begin with a 
decision in our mind regarding such geometric characteristics as the type and size of the triangle, followed 
by our action to move a pen across a piece of paper. In computer graphics terminology, what we have 
envisioned is called the object definition, which defines the triangle in an abstract space of our choosing. 
This space is continuous and is called the object space. Our action to draw maps the imaginary object into 
a triangle on paper, which constitutes a continuous display surface in another space called the image space. 
This mapping action is further influenced by our choice regarding such factors as the location and 
orientation of the triangle. In other words, we may place the triangle in the middle of the paper, or we may 
draw it near the upper left comer. We may have the sharp comer of the triangle pointing to the right, or we 
may have it pointing to the left. 

A comparable process takes place when a computer is used to produce the picture. The major 
computational steps involved in the process give rise to several important areas of computer graphics. The 
area that attends to the need to define objects, such as the triangle, in an efficient and effective manner is 
called geometric representation. In our example we can place a two-dimensional Cartesian coordinate 
system into the object space. The triangle can then be represented by the x and y coordinates of its three 
vertices, with the understanding that the computer system will connect the first and second vertices with a 
line segment, the second and third vertices with another line segment, and the third and first with yet 
another line segment. 

The next area of computer graphics that deals with the placement of the triangle is called 
transformation. Flere we use matrices to realize the mapping of the triangle to its final destination in the 
image space. We can set up the transformation matrix to control the location and orientation of the 
displayed triangle. We can even enlarge or reduce its size. Furthermore, by using multiple settings for the 


1 


2 


INTRODUCTION 


[CHAP. I 



Fig. 1-1 Drawing a triangle. 


transformation matrix, we can instruct the computer to display several triangles of varying size and 
orientation at different locations, all from the same model in the object space. 

At this point most readers may have already been wondering about the crucial difference between the 
triangle drawn on paper and the triangle displayed on the computer monitor (an exaggerated version of 
what you would see on a real monitor). The former has its vertices connected by smooth edges, whereas 
the latter is not exactly a line-drawing. The fundamental reason here is that the image space in computer 
graphics is, generally speaking, not continuous. It consists of a set of discrete pixels, i.e., picture elements, 
that are arranged in a row-and-column fashion. Hence a horizontal or vertical line segment becomes a 
group of adjacent pixels in a row or column, respectively, and a slanted line segment becomes something 
that resembles a staircase. The area of computer graphics that is responsible for converting a continuous 
figure, such as a line segment, into its discrete approximation is called scan conversion. 

The distortion introduced by the conversion from continuous space to discrete space is referred to as 
the aliasing effect of the conversion. While reducing the size of individual pixels should make the 
distortion less noticeable, we do so at a significant cost in tenns of computational resources. For instance, if 
we cut each pixel by half in both the horizontal and the vertical direction we would need four times the 
number of pixels in order to keep the physical dimension of the picture constant. This would translate into, 
among other things, four times the memory requirement for storing the image. Exploring other ways to 
alleviate the negative impact of the aliasing effect is the focus of another area of computer graphics called 
anti-aliasing. 

Putting together what we have so far leads to a simplified graphics pipeline (see Fig. 1-2), which 
exemplifies the architecture of a typical graphics system. At the start of the pipeline, we have primitive 
objects represented in some application-dependent data structures. For example, the coordinates of the 
vertices of a triangle, viz., (xr, , xy ), (x 2 ,y 2 ), and (x 3 ,y 3 ), can be easily stored in a 3 x 2 array. The graphics 
system first performs transformation on the original data according to user-specified parameters, and then 
carries out scan conversion with or without anti-aliasing to put the picture on the screen. The coordinate 
system in the middle box in Fig. 1-2 serves as an intermediary between the object coordinate system on the 



Fig. 1-2 A simple graphics pipeline. 
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left and the image or device coordinate system on the right It is called the world coordinate system, 
representing where we place transformed objects to compose the picture we want to draw. The example in 
the box shows two triangles: the one on the right is a scaled copy of the original that is moved up and to the 
right, the one on the left is another scaled copy of the original that is rotated 90° counterclockwise around 
the origin of the coordinate system and then moved up and to the right in the same way. 

In a typical implementation of the graphics pipeline we would write our application program in a host 
programming language and call library subroutines to perform graphics operations. Some subroutines are 
used to prescribe, among other things, transformation parameters. Others are used to draw, i.e., to feed 
original data into the pipeline so current system settings are automatically applied to shape the end product 
coming out of the pipeline, which is the picture on the screen. 

Having looked at the key ingredients of what is called two-dimensional graphics, we now turn our 
attention to three-dimensional graphics. With the addition of a third dimension one should notice the 
profound distinction between an object and its picture. Figure 1-3 shows several possible ways to draw a 
cubic object, but none of the drawings even come close to being the object itself. The drawings simply 
represent projections of the three-dimensional object onto a two-dimensional display surface. This means 
that besides three-dimensional representation and transformation, we have an additional area of computer 
graphics that covers projection methods. 



Fig. 1-3 Several ways to depict a cube. 


Did you notice that each drawing in Fig. 1-3 shows only three sides of the cubic object? Being a solid 
three-dimensional object the cube has six plane surfaces. However, we depict it as if we were looking at it 
in real life. We only draw the surfaces that are visible to us. Surfaces that are obscured from our eyesight 
are not shown. The area of computer graphics that deals with this computational task is called hidden 
surface removal. Adding projection and hidden surface removal to our simple graphics pipeline, right after 
transformation but before scan conversion, results in a prototype for three-dimensional graphics. 

Now let’s follow up on the idea that we want to produce a picture of an object in real-life fashion. This 
presents a great challenge for computer graphics, since there is an extremely effective way to produce such 
a picture: photography. In order to generate a picture that is photo-realistic, i.e., that looks as good as a 
photograph, we need to explore how a camera and nature work together to produce a snapshot. 

When a camera is used to photograph a real-life object illuminated by a light source, light energy 
coining out of the light source gets reflected from the object surface through the camera lens onto the 
negative, forming an image of the object. Generally, the part of the object that is closer to the light source 
should appear brighter in the picture than the part that is further away, and the part of the object that is 
facing away from the light source should appear relatively dark. Figure 1 -4 shows a computer-generated 



Fig. 1-4 Two shaded spheres. 
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image that depicts two spherical objects illuminated by a light source that is located somewhere between 
the spheres and the “camera” at about the ten to eleven o’clock position. Although both spheres have 
gradual shadings, the bright spot on the large sphere looks like a reflection of the light source and hence 
suggests a difference in their reflectance property (the large sphere being shinier than the small one). The 
mathematical formulae that mimic this type of optical phenomenon are referred to as local illumination 
models, for the energy coming directly from the light source to a particular object surface is not a full 
account of the energy arriving at that surface. Light energy is also reflected from one object surface to 
another, and it can go through a transparent or translucent object and continue on to other places. 
Computational methods that strive to provide a more accurate account of light transport than local 
illumination models are referred to as global illumination models. 

Now take a closer look at Fig. 1 -4. The two objects seem to have super-smooth surfaces. What are they 
made of? How can they be so perfect? Do you see many physical objects around you that exhibit such 
surface characteristics? F urthermore, it looks like the small sphere is positioned between the light source 
and the large sphere. Shouldn’t we see its shadow on the large sphere? In computer graphics the surface 
shading variations that distinguish a wood surface from a marble surface or other types of surface are 
referred to as surface textures. There are various techniques to add surface textures to objects to make them 
look more realistic. On the other hand, the computational task to include shadows in a picture is called 
shadow generation. 

Before moving on to prepare for a closer look at each of the subject areas we have introduced in this 
mini-survey, we want to briefly discuss a couple of allied fields of computer science that also deal with 
graphical information. 


Image Processing 

The key element that distinguishes image processing (or digital image processing) from computer 
graphics is that image processing generally begins with images in the image space and performs pixel- 
based operations on them to produce new images that exhibit certain desired features. For example, we 
may reset each pixel in the image displayed on the monitor screen in Fig. 1-1 to its complementary color 
(e.g., black to white and white to black), turning a dark triangle on a white background to a white triangle 
on a dark background, or vice versa. While each of these two fields has its own focus and strength, they 
also overlap and complement each other. In fact, stunning visual effects are often achieved by using a 
combination of computer graphics and image processing techniques. 


Computer-Human Interaction 

While the main focus of computer graphics is the production of images, the field of computer human 
interaction promotes effective communication between man and machine. The two fields join forces when 
it comes to such areas as graphical user interfaces. There are many kinds of physical devices that can be 
attached to a computer for the purpose of interaction, starting with the keyboard and the mouse. Each 
physical device can often be programmed to deliver the function of various logical devices (e.g., Locator, 
Choice — see below'). For example, a mouse can be used to specify locations in the image space (acting as a 
Locator device). In this case a cursor is often displayed as visual feedback to allow' the user see the 
locations being specified. A mouse can also be used to select an item in a pull-dowm or pop-up manual 
(acting as a Choice device). In this case it is the identification of the selected manual item that counts and 
the item is often highlighted as a whole (the absolute location of the cursor is essentially irrelevant). From 
these we can see that a physical device may be used in different ways and information can be conveyed to 
the user in different graphical forms. The key challenge is to design interactive protocols that make 
effective use of devices and graphics in a way that is user-friendly — easy, intuitive, efficient, etc. 
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1.2 WHAT’S AHEAD 

We hope that our brief flight over the landscape of the graphics kingdom has given you a good 
impression of some of the important landmarks and made you eager to further your exploration. The 
following chapters are dedicated to the various subject areas of computer graphics. Each chapter begins 
with the necessary background information (e.g., context and terminology) and a summary account of the 
material to be discussed in subsequent sections. 

We strive to provide clear explanation and inter-subject continuity in our presentation. Illustrative 
examples are used freely to substantiate discussion on abstract concepts. While the primary mission of this 
book is to offer a relatively well-focused introduction to the fundamental theory and underlying 
technology, significant variations in such matters as basic definitions and implementation protocols are 
presented in order to have a reasonably broad coverage of the field. In addition, interesting applications are 
introduced as early as possible to highlight the usefulness of the graphics technology and to encourage 
those who are eager to engage in hands-on practice. 

Algorithms and programming examples are given in pseudo-code that resembles the C programming 
language, which shares similar syntax and basic constructs with other widely used languages such as C-H — J- 
and Java. We hope that the relative simplicity of the C-style code presents little grammatical difficulty and 
hence makes it easy for you to focus your attention on the technical substance of the code. 

There are numerous solved problems at the end of each chapter to help reinforce the theoretical 
discussion. Some of the problems represent computation steps that are omitted in the text and are 
particularly valuable for those looking for further details and additional explanation. Other problems may 
provide new information that supplements the main discussion in the text. 



CHAPTER 2 


Image Representation 


A digital image, or image for short, is composed of discrete pixels or picture elements. These pixels are 
arranged in a row-and-column fashion to form a rectangular picture area, sometimes referred to as a raster. 
Clearly the total number of pixels in an image is a function of the size of the image and the number of 
pixels per unit length (e.g. inch) in the horizontal as well as the vertical direction. This number of pixels per 
unit length is referred to as the resolution of the image. Thus a 3 x 2 inch image at a resolution of 300 
pixels per inch would have a total of 540,000 pixels. 

Frequently image size is given as the total number of pixels in the horizontal direction times the total 
number of pixels in the vertical direction (e.g., 512 x 512, 640 x 480, or 1024 x 768). Although this 
convention makes it relatively straightforward to gauge the total number of pixels in an image, it does not 
specify the size of the image or its resolution, as defined in the paragraph above. A 640 x 480 image would 
measure 6| inches by 5 inches when presented (e.g., displayed or printed) at 96 pixels per inch. On the 
other hand, it would measure 1.6 inches by 1.2 inches at 400 pixels per inch. 

The ratio of an image’s width to its height, measured in unit length or number of pixels, is referred to 
as its aspect ratio. Both a 2 x 2 inch image and a 512 x 512 image have an aspect ratio of 1/1, whereas 
both a 6 x 4 j inch image and a 1024 x 768 image have an aspect ratio of 4/3. 

Individual pixels in an image can be referenced by their coordinates. Typically the pixel at the lower 
left comer of an image is considered to be at the origin (0, 0) of a pixel coordinate system. Thus the pixel at 
the lower right comer of a 640 x 480 image would have coordinates (639,0), whereas the pixel at the 
upper right comer would have coordinates (639,479). 

The task of composing an image on a computer is essentially a matter of setting pixel values. The 
collective effects of the pixels taking on different color attributes give us what we see as a picture. In this 
chapter we first introduce the basics of the most prevailing color specification method in computer graphics 
(Sect. 2.1). We then discuss the representation of images using direct coding of pixel colors (Sect. 2.2) 
versus using the lookup-table approach (Sect. 2.3). Following a discussion of the working principles of two 
representative image presentation devices, the display monitor (Sect. 2.4) and the printer (Sect. 2.5), we 
examine image files as the primary means of image storage and transmission (Sect. 2.6). We then take a 
look at some of the most primitive graphics operations, which primarily deal with setting the color 
attributes of pixels (Sect. 2.7). Finally, to illustrate the construction of beautiful images directly in the 
discrete image space, we introduce the mathematical background and detail the algorithmic aspects of 
visualizing the Mandelbrot set (Sect. 2.8). 
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2.1 THE RGB COLOR MODEL 

Color is a complex, interdisciplinary subject spanning from physics to psychology. In this section we 
only introduce the basics of the most widely used color representation method in computer graphics. We 
will have additional discussion later in another chapter. 

Figure 2-1 shows a color coordinate system with three primary colors: R (red), G (green), and B 
(blue). Each primary color can take on an intensity value ranging from 0 (off— lowest) to 1 (on — highest). 
Mixing these three primary colors at different intensity levels produces a variety of colors. The collection 
of all the colors obtainable by such a linear combination of red, green, and blue forms the cube-shaped 
RGB color space. The comer of the RGB color cube that is at the origin of the coordinate system 
corresponds to black, whereas the comer of the cube that is diagonally opposite to the origin represents 
white. The diagonal line connecting black and white corresponds to all the gray colors between black and 
white. It is called the gray axis. 



Given this RGB color model an arbitrary color wit hi n the cubic color space can be specified by its 
color coordinates: (r, g, b). For example, we have (0, 0, 0) for black, (1,1, 1 ) for w'hite, (1,1,0) for yellow', 
etc. A gray color at (0.7, 0.7, 0.7) has an intensity hallway between one at (0.9, 0.9, 0.9) and one at 
(0.5, 0.5, 0.5). 

Color specification using the RGB model is an additive process. We begin with black and add on the 
appropriate primary components to yield a desired color. This closely matches the working principles of 
the display monitor (see Sect. 2.4). On the other hand, there is a complementary color model, called the 
CMY color model, that defines colors using a subtractive process, which closely matches the working 
principles of the printer (see Sect. 2.5). 

In the CMY model we begin with white and take away the appropriate primary components to yield a 
desired color. For example, if we subtract red from w'hite, w'hat remains consists of green and blue, which 
is cyan. Looking at tliis from another perspective, we can use the amount of cyan, the complementary color 
of red, to control the amount of red, which is equal to one minus the amount of cyan. Figure 2-2 show's a 
coordinate system using the three primaries’ complementary colors: C (cyan), M (magenta), and Y 
(yellow'). The corner of the CMY color cube that is at (0, 0, 0) corresponds to white, whereas the corner of 
the cube that is at (1, 1, 1) represents black (no red, no green, no blue). The following formulas summarize 
the conversion between the two color models: 
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2.2 DIRECT CODING 

Image representation is essentially the representation of pixel colors. Using direct coding we allocate a 
certain amount of storage space for each pixel to code its color. For example, we may allocate 3 bits for 
each pixel, with one bit for each primary color (see Fig. 2-3). This 3-bit representation allows each primary 
to vary independently between two intensity levels: 0 (off) or 1 (on). Hence each pixel can take on one of 
the eight colors that correspond to the comers of the RGB color cube. 


bit 1: r bit 2: g bit 3: b 

0 0 0 

0 0 I 

0 1 0 

0 1 I 

1 0 0 

1 0 1 

1 1 0 

1 1 t 


color name 

black 

blue 

green 

cyan 

red 

magenta 

yellow 

white 


Fig. 2-3 Direct coding of colors using 3 bits. 


A widely accepted industry standard uses 3 bytes, or 24 bits, per pixel, with one byte for each primary 
color. This way we allow each primary color to have 256 different intensity levels, corresponding to binary 
values from 00000000 to 11111111. Titus a pixel can take on a color from 256 x 256 x 256 or 16.7 
mill ion possible choices. This 24-bit format is commonly referred to as the true color representation, for 
the difference between two colors that differ by one intensity level in one or more of the primaries is 
virtually undetectable under normal viewing conditions. Hence a more precise representation involving 
more bits is of little use in terms of perceived color accuracy. 

A notable special case of direct coding is the representation of black-and-white (bilevel) and gray-scale 
images, where the three primaries always have the same value and hence need not be coded separately. A 
black-and-white image requires only one bit per pixel, with bit value 0 representing black and 1 
representing white. A gray-scale image is typically coded with 8 bits per pixel to allow a total of 256 
intensity or gray levels. 

Although this direct coding method features simplicity and has supported a variety of applications, we 
can see a relatively high demand for storage space when it comes to the 24-bit standard. For example, a 
1000 x 1000 true color image would take up three million bytes. Furthermore, even if every pixel in that 
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image had a different color, there would only be one million colors in the image. In many applications the 
number of colors that appear in any one particular image is much less. Therefore the 24-bit representation’s 
ability to have 16.7 million different colors appear simultaneously in a single image seems to be somewhat 
overkill. 


2.3 LOOKUP TABLE 

Image representation using a lookup table can be viewed as a compromise between our desire to have a 
lower storage requirement and our need to support a reasonably sufficient number of simultaneous colors. 
In this approach pixel values do not code colors directly. Instead, they are addresses or indices into a table 
of color values. The color of a particular pixel is determined by the color value in the table entry that the 
value of the pixel references. 

Figure 2-4 shows a lookup table with 256 entries. The entries have addresses 0 through 255. Each 
entry contains a 24-bit RGB color value. Pixel values are now 1 -byte, or 8-bit, quantities. The color of a 
pixel whose value is i, where 0 < /' < 255, is determined by the color value in the table entry whose 
address is /. This 24-bit 256-entry lookup table representation is often referred to as the 8-bit format. It 
reduces the storage requirement of a 1000 x 1000 image to one million bytes plus 768 bytes for tire color 
values in the lookup table. It allows 256 simultaneous colors that are chosen from 16.7 million possible 
colors. 


o 

t 

2 


I ~ I ► 

8-bit pixel value 


255 


Fig. 2-4 A 24-bit 256-entry lookup table. 


It is important to remember that, using the lookup table representation, an image is defined not only by 
its pixel values but also by the color values in the corresponding lookup table. Those color values form a 
color map for the image. 
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2.4 DISPLAY MONITOR 

Among the numerous types of image presentation or output devices that convert digitally represented 
images into visually perceivable pictures is the display or video monitor. 

We first take a look at the working principle of a monochromatic display monitor, which consists 
mainly of a cathode ray tube (CRT) along with related control circuits. The CRT is a vacuum glass tube 
with the display screen at one end and connectors to the control circuits at the other (see Fig. 2-5). Coated 
on the inside of the display screen is a special material, called phosphor, which emits fight for a period of 
time when hit by a beam of electrons. The color of the fight and the time period vary from one type of 
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Electron gun 



phosphor to another. The light given off by the phosphor dining exposure to the electron beam is known as 
fluorescence, the continuing glow given off after the beam is removed is known as phosphorescence, and 
the duration of phosphorescence is known as the phosphor’s persistence. 

Opposite to the phosphor-coated screen is an electron gun that is heated to send out electrons. The 
electrons are regulated by the control electrode and forced by the focusing electrode into a narrow beam 
striking the phosphor coating at small spots. When this electron beam passes through the horizontal and 
vertical deflection plates, it is bent or deflected by the electric fields between the plates. Tire horizontal 
plates control the beam to scan from left to right and retrace from right to left. The vertical plates control 
the beam to go from the first scan line at the top to the last scan line at the bottom and retrace from the 
bottom back to the top. These actions are synchronized by the control circuits so that the electron beam 
strikes each and every pixel position in a scan line by scan line fashion. As an alternative to this 
electrostatic deflection method, some CRTs use magnetic deflection coils mounted on the outside of the 
glass envelope to bend the electron beam with magnetic fields. 

The intensity of the light emitted by the phosphor coating is a function of the intensity of the electron 
beam. The control circuits shut off the electron beam during horizontal and vertical retraces. The intensity 
of the beam at a particular pixel position is determined by the intensity value of the corresponding pixel in 
the image being displayed. 

The image being displayed is stored in a dedicated system memory area that is often referred to as the 
frame buffer or refresh buffer. The control circuits associated with the frame buffer generate proper video 
signals for the display monitor. The frequency at which the content of the frame buffer is sent to the display 
monitor is called the refreshing rate, which is typically 60 times or frames per second (60 Hz) or higher. A 
determining factor here is die need to avoid flicker, which occurs at lower refreshing rates when our visual 
system is unable to integrate the light impulses from the phosphor dots into a steady picture. The 
persistence of the monitor’s phosphor, on the other hand, needs to be long enough for a frame to remain 
visible but short enough for it to fade before the next frame is displayed. 

Some monitors use a technique called interlacing to “double” their refreshing rate. In this case only 
half of the scan lines in a frame is refreshed at a time, first the odd numbered lines, then the even numbered 
litres. Thus the screen is refreshed from top to bottom in half the time it would have taken to sweep across 
all the scan lines. Although this approach does not really increase the rate at which the entire screen is 
refreshed, it is quite effective in reducing flicker. 
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Color Display 

Moving on to color displays there are now three electron guns instead of one inside the CRT (see Fig. 
2-6), with one electron gun for each primary color. The phosphor coating on the inside of the display 
screen consists of dot patterns of three different types of phosphors. These phosphors are capable of 
emitting red, green, and blue light, respectively. Tire distance between the center of the dot patterns is 
called the pitch of the color CRT. It places an upper limit on the number of addressable positions on the 
display area. A thin metal screen called a shadow mask is placed between the phosphor coating and the 
electron guns. The tiny holes on the shadow mask constrain each electron beam to hit its corresponding 
phosphor dots. When viewed at a certain distance, light emitted by tire three types of phosphors blends 
together to give us a broad range of colors. 


Electron guns 



2.5 PRINTER 

Another typical image presentation device is the printer. A printer deposits color pigments onto a print 
media, changing the light reflected from its surface and making it possible for us to see the print result. 

Given the fact that the most commonly used print media is a piece of white paper, we can in principle 
utilize three types of pigments (cyan, magenta, and yellow) to regulate tire amount of red, green, and blue 
light reflected to yield all RGB colors (see Sect. 2.1). However, in practice, an additional black pigment is 
often used due to the relatively high cost of color pigments and the technical difficulty associated with 
producing high-quality black from several color pigments. 

While some printing methods allow color pigments to blend together, in many cases the various color 
pigments remain separate in the form of tiny dots on the print media. Furthermore, the pigments are often 
deposited with a limited number of intensity levels. There are various techniques to achieve the effect of 
multiple intensity levels beyond what the pigment deposits can offer. Most of these techniques can also be 
adapted by the display devices that we have just discussed in the previous section. 


Halftoning 

Let’s first take a look at a traditional technique called halftoning from the printing industry for bilevel 
devices. This technique uses variably sized pigment dots that, when viewed from a certain distance, blend 
with the white background to give us the sensation of varying intensity levels. These dots are arranged in a 
pattern that fonns a 45 ° screen angle with the horizon (see Fig. 2-7 where the dots are enlarged for 
illustration). The size of the dots is inversely proportional to the intended intensity level. When viewed at a 
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far enough distance, the stripe in Fig. 2-7 exhibits a gradual shading from white (high intensity) on the left 
to black (low intensity) on the right. An image produced using this technique is called a halftone. In 
practice, newspaper halftones use 60 to 80 dots per inch, whereas book and magazine halftones use 120 to 
200 dots per inch. 
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Fig. 2-7 A halftone stripe. 


Halftone Approximation 

Instead of changing dot size we can approximate the halftone technique using pixel-grid patterns. For 
example, with a 2 x 2 bilevel pixel grid we can construct five grid patterns to produce five overall intensity 
levels (see Fig. 2-8). We can increase the number of overall intensity levels by increasing the size of the 
pixel grid (see the following paragraphs for an example). On the other hand, if the pixels can be set to 
multiple intensity levels, even a 2 x 2 grid can produce a relatively high number of overall intensity levels. 
For example, if the pixels can be intensified to four different levels, we can follow the pattern sequence in 
Fig. 2-8 to bring each pixel from one intensity level to the next to approximate a total of thirteen overall 
intensity levels (one for all pixels off and four for each of the three non-zero intensities, see Fig. 2-9). 


H H a B 



Fig. 2-8 Halftone approximation. 
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Fig. 2-9 Halftone approximation with 13 intensity levels. 


These halftone grid patterns are sometimes referred to as dither patterns. There are several 
considerations in the design of dither patterns. First, the pixels should be intensified in a growlh-from- 
the-grid-center fashion in order to mimic the growth of dot size. Second, a pixel that is intensified to a 
certain level to approximate a particular overall intensity should remain at least at that level for all 
subsequent overall intensity levels. In other words, the patterns should evolve from one to the next in order 
to minimize the differences in the patterns for successive overall intensity levels. Third, symmetry should 
be avoided in order to minimize visual artifacts such as streaks that would show up in image areas of 
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uniform intensity. Fourth, isolated “on” pixels should be avoided since they are sometimes hard to 
reproduce. 

We can use a dither matrix to represent a series of dither patterns. For example, the following 3x3 
matrix: 

5 2 7 
I 0 3 

6 8 4 

represents the order in which pixels in a 3 x 3 grid are to be intensified. For bilevel reproduction, this gives 
us ten intensity levels from level 0 to level 9, and intensity level I is achieved by turning on all pixels that 
correspond to values in the dither matrix that are less than I. If each pixel can be intensified to three 
different levels, we follow the order defined by the matrix to set the pixels to their middle intensity level 
and then to their high intensity level to approximate a total of nineteen overall intensity levels. 

This halftone approximation technique is readily applicable to the reproduction of color images. All 
we need is to replace the dot at each pixel position with an RGB or CMY dot pattern (e.g., the triad pattern 
shown in Fig. 2-6). If we use a 2 x 2 pixel grid and each primary or its complement can take on two 
intensity levels, we achieve a total of 5 x 5 x 5 = 125 color combinations. 

At this point we can turn to the fact that halftone approximation is a technique that trades spatial 
resolution for more colors/intensity levels. For a device that is capable of producing images at a resolution 
of 400 x 400 pixels per inch, halftone approximation using 2x2 dither patterns would mean lowering its 
resolution effectively to 200 x 200 pixels per inch. 


Dithering 

A technique called dithering can be used to approximate halftones without reducing spatial resolution. 
In this approach the dither matrix is treated very much like a floor tile that can be repeatedly positioned one 
copy next to another to cover the entire floor, i.e., the image. A pixel at (x, y) is intensified if the intensity 
level of the image at that position is greater than the corresponding value in the dither matrix. 
Mathematically, if D„ stands for an n x n dither matrix, the element D n (i,j) that corresponds to pixel 
position (x,y) can be found by i = x mod n and j =y mod n. For example, if we use the 3x3 matrix 
given earlier for a bilevel reproduction and the pixel of the image at position (2, 19) has intensity level 5, 
then the corresponding matrix element is £> 3 (2, 1) = 3, and hence a dot should be printed or displayed at 
that location. 

It should be noted that, for image areas that have constant intensity, the results of dithering are exactly 
the same as the results of halftone approximation. Reproduction differences between these two methods 
occur only when intensity varies. 


Error Diffusion 

Another technique for continuous-tone reproduction without sacrificing spatial resolution is called the 
Floyd Steinberg error diffusion. Here a pixel is printed using the closest intensity the device can deliver. 
The error term, i.e., the difference between the exact pixel value and the approximated value in the 
reproduction, is then propagated to several yet-to-be-processed neighboring pixels for compensation. More 
specifically, let S be the source image that is processed in a leff-to-right and top-to-bottom pixel order, 
S{x, y) be the pixel value at location (x, y), and e be S(x, y) minus Hie approximated value. We update the 
value of the pixel’s four neighbors (one to its right and three in the next scan line) as follows: 

S(x + 1 , y) = S(x + 1 , y) + ae 
S(x — 1 ,y — 1) = S(x — l,y — 1) + be 
S(x, y — 1 ) = S(x, y — 1 ) + ce 
S(x+ l,y — 1) = 5(x+ l,y — 1) + de 
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where parameters a through d often take values and yg, respectively. These modifications are for 

the purpose of using the neighboring pixels to offset the reproduction error at the current pixel location. 
They are not permanent changes made to the original image. 

Consider, for example, the reproduction of a gray scale image (0: black, 255: white) on a bilevel device 
( level 0: black, level 1 : white), if a pixel whose current value is 96 has just been mapped to level 0, we have 
e — 96 for this pixel location. The value of the pixel to its right is now increased by 96 x yg = 42 in order 
to determine the appropriate reproduction level. This increment tends to cause such neighboring pixel to be 
reproduced at a higher intensity level, partially compensating the discrepancy brought on by mapping 
value 96 to level 0 (which is lower than the actual pixel value) at the current location. The other three 
neighboring pixels (one below and to the left, one immediately below, and one below and to the right) 
receive 1 8, 30, and 6 as their share of the reproduction error at the current location, respectively. 

Results produced by this error diffusion algorithm are generally satisfactory, with occasional 
introduction of slight echoing of certain image parts. Improved performance can sometimes be obtained 
by alternating scanning direction between left-to-right and right-to-left (minor modifications need to be 
made to the above formulas). 


2.6 IMAGE FILES 

A digital image is often encoded in the fonn of a binary file for the puipose of storage and 
transmission. Among the numerous encoding formats are BMP (Windows Bitmap), JPEG (Joint 
Photographic Experts Group File Interchange Format), and TIFF (Tagged Image File Format). Although 
these formats differ in technical details, they share structural similarities. 

Figure 2- 1 0 shows the typical organization of information encoded in an image file. Tlie file consists 
largely of two parts: header and image data. In the beginning of the file header a binary code or ASCII 
string identifies the format being used, possibly along with the version number. The width and height of the 
image are given in numbers of pixels. Common image types include black and white ( 1 bit per pixel ), 8-bit 
gray scale (256 levels along the gray axis), 8-bit color (lookup table), and 24-bit color. Image data format 
specifies the order in which pixel values are stored in the image data section. A commonly used order is left 
to right and top to bottom. Another possible order is left to right and bottom to top. Image data fonnat also 
specifies if the RGB values in the color map or in the image are interlaced. When the values are given in an 
interlaced fashion, the three primary color components for a particular lookup table entry or a particular 
pixel stay together consecutively, followed by the three color components for the next entry or pixel. Thus 
the color values in the image data section are a sequence of red, green, blue, red, green, blue, etc. When the 
values arc given in a non-interlaced fashion, the values of one primary for all table entries or pixels appear 


Format 'version identification 

Image width aud height in pixels 

Image type 

Image data format 

Compression type 

etc. 

\ 

‘x 

Header 

't 

Color map (if any) 

Pixel values 

/ 

3 

\ 

Image data 


Fig. 2-10 Typical image file fonnat. 
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first, then the values of another primary, followed by the values of the third primary. Thus the image data 
are in the form of red, red green, green, . . . , blue, blue 

The values in the image data section may be compressed, using such compression algorithms as run- 
length encoding (RLE). The basic idea behind RLE can be illustrated with a character string 
“xxxxxxyyzzzz”, which takes 12 bytes of storage. Now if we scan the string from left to right for 
segments of repeating characters and replace each segment by a 1-byte repeat count followed by the 
character being repeated, we convert or compress the given string to “6x2y4z”, which takes only 6 bytes. 
This compressed version can be expanded or decompressed by repeating the character following each 
repeat count to recover the original string. 

The length of the file header is often fixed, for otherwise it would be necessary to include length 
information in the header to indicate where image data starts (some formats include header length anyway). 
The length of each individual component in the image data section is, on the other hand, dependent on such 
factors as image type and compression type. Such information, along with additional format-specific 
information, can also be found in the header. 


2.7 SETTING THE COLOR ATTRIBUTES OF PIXELS 

Setting the color attributes of individual pixels is arguably the most primitive graphics operation. It is 
typically done by making system library calls to write the respective values into the frame buffer. An 
aggregate data structure, such as a three-element array, is often used to represent the three primary color 
components. Regardless of image type (direct coding versus lookup table), there are two possible protocols 
for the specification of pixel coordinates and color values. 

In one protocol the application provides both coordinate information and color information 
simultaneously. Thus a call to set the pixel at location (.v, v) in a 24-bit image to color (r, g, b) would 
look like 

setPixel(.v, v, rgb) 

where rgb is a three-element array with rgZ>[0] = r, rgh[ 1 J = g, and rgb[2] = b. On the other hand, if the 
image uses a lookup table then, assuming that the color is defined in the table, the call would look like 

setPixel(x, y, i) 

where i is the address of the entry containing (r, g, b). 

Another protocol is based on the existence of a current color, which is maintained by the system and 
can be set by calls that look like 

setColor(/*g6) 

for direct coding, or 

setColor(Z) 

for the lookup table representation. Calls to set pixels now need only to provide coordinate information and 
would look like 

setPixel(.r,v) 

for both image types. The graphics system will automatically use the most recently specified current color 
to carry out the operation. 

Lookup table entries can be set from the application by a call that looks like 
setEntryf/, rgb) 

which puts color (r, g, b) in the entry whose address is i. Conversely, values in the lookup table can be read 
back to the application with a call that looks like 

getEntry(/, rgb) 

which returns the color value in entry / through array parameter rgb. 
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There are sometimes two versions of the calls that specify RGB values. One takes RGB values as 
floating point numbers in the range of [0.0, 1 .0], whereas the other takes them as integers in the range of 
[0,255]. Although the floating point version is handy when the color values come from some continuous 
formula, the floating point values are mapped by the graphics system into integer values before being 
written into the frame buffer. 

In order to provide basic support for pixel-based image-processing operations there are calls that look 
like 


gelPixcl(x, v. rgb) 
for direct coding or 
getPixelfx, y, i) 

for the lookup table representation to return the color or index value of the pixel at (x, y) back to the 
application. 

There are also calls that read and write rectangular blocks of pixels. A useful example would be a call 
to set all pixels to a certain background color. Assuming that the system uses a current color we would first 
set the current color to be the desired background color, and then make a call that looks like: 

clearf) 

to achieve the goal. 


2.8 EXAMPLE: VISUALIZING THE MANDELBROT SET 

An elegant and illustrative example showing the construction of beautiful images by setting the color 
attributes of individual pixels directly from the application is the visualization of the Mandelbrot set. This 
remarkable set is based on the following transformation: 

* n+ i =-^+z 

where both x and z represent complex numbers. For readers who are unfamiliar with complex numbers it 
suffices to know that a complex number is defined in the form of a + bi. Here both a and b are real 
numbers; a is called the real part of the complex number and b the imaginary part (identified by the special 
symbol /). The magnitude of a + bi, denoted by | a -)- bi |, is equal to the square root of a 2 + b 2 . The sum of 
two complex numbers a + bi and c + di is defined to be (a 4- c) + (b 4- d)i. The product of a + bi and 
c + di is defined to be ( ac — bd ) + (ad + bc)i. Thus the square of a + bi is equal to (a 2 — b 2 ) + 2 abi. For 
example, the sum of 0.5 + 2.0/ and 1.0 — 1.0/ is 1.5 + 1.0/. The product of the two is 2.5 + 1.5/. The 
square of 0.5 + 2.0/ is —3.75 + 2.0/ and the square of 1.0— 1.0/ is 0.0 — 2.0/. 

The Mandelbrot set is the set of complex numbers z that do not diverge under the above transformation 
with x 0 = 0 (both the real and imaginary parts of x 0 are 0). In other words, to determine if a particular 
complex number z is a member of the set, we begin with x 0 = 0, followed by x, = .fy + z, 
x 2 = Xj + z x n+1 = x£ + z, If |x| goes towards infinity when n increases, then z is not a mem- 

ber. Otherwise, z belongs to the Mandelbrot set. 

Figure 2-1 1 shows how to produce a discrete snapshot of the Mandelbrot set. On the left hand side is 
the complex plane where the horizontal axis Re measures the real part of complex numbers and the vertical 
axis Im measures the imaginary part. Hence an arbitrary complex number z corresponds to a point in the 
complex plane. Our goal is to produce an image of width by height (in numbers of pixels) that depicts the z 
values in a rectangular area defined by (Re_min, Im_min) and (Re_max, Im_max). This rectangular area 
has the same aspect ratio as the image so as not to introduce geometric distortion. We subdivide the area to 
match the pixel grid in the image. The color of a pixel, shown as a little square in the pixel grid, is 
determined by the complex number z that corresponds to the lower left comer of the little square. Although 
only width x height points in the complex plane are used to compute the image, this relatively 
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Fig. 2-1 1 Visualizing the Mandelbrot set. 


straightforward approach to discrete sampling produces reasonably good approximations for the purpose of 
visualizing the set. 

There are many ways to decide the color of a pixel based on the corresponding complex number z. 
What we do here is to produce a gray scale image where the gray level of a non-black pixel represents 
proportionally the number of iterations it takes for |x| to be greater than 2. We use 2 as a threshold for 
divergence because x diverges quickly under the given transformation once |x| becomes greater than 2. If 
|x| remains less than or equal to 2 after a preset maximum number of iterations, we simply set the pixel 
value to 0 (black). 

The following pseudo-code implements what we have discussed in the above paragraphs. We use N to 
represent the maximum number of iterations, z.real the real part of z, and z.imag the imaginary part of z. 
We also assume a 256-entry gray scale lookup table where the color value in entry / is (/, i, i). The formula 
in the second call to setColor is to obtain a proportional mapping from [0. N] to [1,255]: 

int i,j, count; 

float delta = (Re_max — Re_min)/width; 

for (i = 0, z.real = Re_min; i < width; i ++ , z.real+ = delta) 
for (_/ = 0, z.imag = Im_min; j < height; y'++, z.imag + = delta) { 
count = 0; 

complex number x = 0; 
while ( |-y| <= 2.0 && count < N) { 
compute .v = x 2 + z; 
couut++; 

} 

if (|x| <= 2.0) setColor(O); 
else setColor) 1 + 254*count/A / ); 
setPixel(/,y); 

> 

The image in Fig. 2-12 shows what is nicknamed the Mandelbrot bug. It visualizes an area where 
—2.0 < z. real < 0.5 and —1.25 < z. imag < 1.25 with N — 64. Most z values that are outside the area 
lead x to diverge quickly, whereas the z values in the black region belong to the Mandelbrot set. It is along 
the contour of the bug-like figure that we see the most dynamic alterations between divergence and non- 
divergence, together with the most significant variations in the number of iterations used in the divergence 
test. The brighter a pixel, the longer it took to conclude divergence for the corresponding z. In principle the 
rectangular area can be reduced indefinitely to zoom in on any active region to show more intricate details. 
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Fig. 2-12 The Mandelbrot set 


Julia Sets 

Now if we set z to some fixed non-zero value and vary x 0 across the complex plane, we obtain a set of 
non -divergence numbers (values of ,v 0 that do not diverge under the given transformation) that form a Julia 
set. Different z values lead to different Julia sets. The image in Fig. 2-13 is produced by making slight 
modifications to the pseudo-code for the Mandelbrot set. It shows the Julia set defined by 
z — —0.74543 + 0.11301/ with —1.2 < .v 0 .real < 1.2, —1.2 <x 0 .imag < 1.2, and N = 128. 



Fig. 2-13 A Julia set. 


Solved Problems 


2.1 What is the resolution of an image? 

SOLUTION 

The number of pixels (i.e., picture elements) per unit length (e.g., inch) in the horizontal as well as 
vertical direction. 


2.2 Compute the size of a 640 x 480 image at 240 pixels per inch. 

SOLUTION 


640/240 by 480/240 or 2 f by 2 inches. 
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2.3 Compute the resolution of a 2 x 2 inch image that has 512 x 512 pixels. 

SOLUTION 

512/2 or 256 pixels per inch. 

2.4 What is an image’s aspect ratio? 

SOLUTION 

The ratio of its width to its height, measured in unit length or number of pixels. 

2.5 If an image has a height of 2 inches and an aspect ratio of 1.5, what is its width? 

SOLUTION 

width = 1 .5 x height = 1 .5 x 2 = 3 inches. 

2.6 If we want to resize a 1024 x 768 image to one that is 640 pixels wide with the same aspect ratio, 
what would be the height of the resized image? 

SOLUTION 

height = 640 x 768/1024 = 480. 

2.7 If we want to cut a 512 x 512 sub-image out from the center of an 800 x 600 image, what are the 
coordinates of the pixel in the large image that is at the lower left comer of the small image? 

SOLUTION 

[(800 - 5 1 1)12, (600-51 2)/2] =(144, 44). 

2.8 Sometimes the pixel at the upper left comer of an image is considered to be at the origin of the pixel 
coordinate system (a left-handed system). How to convert the coordinates of a pixel at (x, y) in this 
coordinate system into its coordinates (x\ y') in the lower-left-comer-as-origin coordinate system (a 
right-handed system)? 

SOLUTION 

(x', y) = (x. m — y — 1) where m is the number of pixels in the vertical direction. 

2.9 Find the CMY coordinates of a color at (0.2, 1,0.5) in the RGB space. 

SOLUTION 

(1 -0.2, 1-1,1 -0.5) = (0.8.0, 0.5). 

2.10 Find the RGB coordinates of a color at (0.15,0.75,0) in the CMY space. 

SOLUTION 

(1 - 0.15, 1 - 0.75, 1 - 0) = (0.85, 0.25, 1). 

2.11 If we use direct coding of RGB values with 2 bits per primary color, how many possible colors do 
we have for each pixel? 



20 IMAGE REPRESENTATION [CHAP. 2 

SOLUTION 

2 2 x 2 2 x 2 2 = 4 x 4 x 4 = 64. 

2.12 If we use direct coding of RGB values with 10 bits per primary color, how many possible colors do 
we have for each pixel? 

SOLUTION 

2 10 x 2 10 x 2 1U = 1 024 3 = 1073.741.824 > 1 billion. 

2.13 The direct coding method is flexible in that it allows the allocation of a different number of bits to 
each primary color. If we use 5 bits each for red and blue and 6 bits for green for a total of 16 bits 
per pixel, how many possible simultaneous colors do we have? 

SOLUTION 

2 5 x 2 5 x 2 6 = 2 16 = 65.536. 

2.14 If we use 12-bit pixel values in a lookup table representation, how many entries does the lookup 
table have? 

SOLUTION 

2 12 = 4096. 

2.15 If we use 2-byte pixel values in a 24-bit lookup table representation, how many bytes does tire 
lookup table occupy? 

SOLUTION 

2 16 x 24/8 = 65.536 x 3 = 196,608. 

2.16 True or false: fluorescence is the term used to describe the light given off by a phosphor after it has 
been exposed to an electron beam. Explain your answer. 

SOLUTION 

False. Phosphorescence is the correct term. Fluorescence refers to the light given off by a phosphor while 
it is being exposed to an electron beam. 

2.17 What is persistence? 

SOLUTION 

The duration of phosphorescence exhibited by a phosphor. 

2.18 What is the function of the control electrode in a CRT? 

SOLUTION 

Regulate the intensity of the electron beam. 

2.19 Name the two methods by which an electron beam can be bent? 

SOLUTION 


Electrostatic deflection and magnetic deflection. 
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2.20 What do you call the path the electron beam takes when returning to the left side of the CRT screen? 
SOLUTION 

Horizontal retrace. 

2.21 What do you call the path the electron beam takes at the end of each refresh cycle? 

SOLUTION 

Vertical retrace. 

2.22 What is the pitch of a color CRT? 

SOLUTION 

The distance between the center of the phosphor dot patterns on the inside of the display screen. 

2.23 Why do many color printers use black pigment? 

SOLUTION 

Color pigments (cyan, magenta, and yellow) are relatively more expensive and it is technically difficult to 
produce high-quality black using several color pigments. 

2.24 Show' that with an n x n pixel grid, where each pixel can take on m intensity levels, w'e can 
approximate n x n x (»? — 1) + 1 overall intensity levels. 

SOLUTION 

Since the n x ;/ pixels can be set to a non-zero intensity value one after another to produce n x n overall 
intensity levels, and there are m — 1 non-zero intensity levels for the individual pixels, w'e can approximate a 
total of n x n x (m — 1) non-zero overall intensity levels. Finally we need to add one more overall intensity 
level that corresponds to zero intensity (all pixels oft). 

2.25 Represent the grid patterns in Fig. 2-8 with a dither matrix. 

SOLUTION 

a o 

2.26 What are the error propagation formulas for a top-to-bottom and right-to-left scanning order in the 
Floyd Steinberg error diffusion algorithm? 

SOLUTION 

S(x — 1 , y) = S(x — 1 , y) + ae 
S(x+ l,y— 1) = S(x + l,y— 1 ) + be 
S(x. y — l ) = S(x, y — 1 ) 4- ce 
Six — 1 ,y — 1) = S(x — 1 ,y — 1 ) + de 

2.27 What is RLE? 

SOLUTION 

RLE stands for run-length encoding, a technique tor image data compression. 
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2.28 Follow the illustrative example in the text to reconstruct the string that has been compressed to 
“981435” using RLE. 

SOLUTION 

“8888888884555” 


2.29 If an 8-bit gray scale image is stored uncompressed in sequential memory or in an image file in left- 
to-right and bottom-to-top pixel order, what is the offset or displacement of the byte for the pixel at 
(,v. y) from the beginning of the memory segment or the file’s image data section? 

SOLUTION 

offset = y x n + x where it is the number of pixels in the horizontal direction. 

2.30 What if the image in Prob. 2.29 is stored in left-to-right and top-to-bottom order? 

SOLUTION 

offset = (m —y — I)« +x where « and m are the number of pixels in the horizontal and vertical 
direction, respectively. 


2.31 Develop a pseudo-code segment to initialize a 24-bit 256-entry lookup table with gray-scale values. 

SOLUTION 

int i, ygf>[3]; 

for (i = 0: i < 256: i++) { 
rgb[Q] = rgb[ I] = rgb[ 2] = i; 
setEntrvf/, rgb): 

} 


2.32 Develop a pseudo-code segment to swap the red and green components of all colors in a 256-entry 
lookup table. 

SOLUTION 

int i,x, rgb[3]; 
for (i = 0; i < 256: /++) { 
getEntryf/', rgb); 
x = rgb[0]; 
rg6[0] = rg/>[l]: 

rgb[ 1] = Jc; 

setEntryp, rgb); 

} 

2.33 Develop a pseudo-code segment to draw a rectangular area of w x h ( in number of pixels) that starts 
at (x, y) using color rgb. 

SOLUTION 

int 

setColoi irgb); 

for (j = v; j < y + h; /++) 

for (/ = x; i < x + w; /++) setPixel(/,/); 
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2.34 Develop a pseudo-code segment to draw a triangular area with the three vertices at (x,y), (.v. v + t), 
and (x -(- 1, y), where integer / > 0, using color rgb. 

SOLUTION 

int i,j; 

setC'olop/'g/p; 

for (j = v; j <= v + /; j++) 

for (/ = x; i <= x + y + t—j ; /++) setPixel(/,_/); 


2.35 Develop a pseudo-code segment to reset every pixel in an image that is in the 24-bit 256-entry 
lookup table representation to its complementary color. 

SOLUTION 

int /, r«/>[3j; 

for (/ = 0; / < 256; /++) { 
getEntry(/\ rgb); 
rg/>[0] = 255 - rg6[0]; 
rg6[lj =255 -rgi[lj; 
rgb[ 2] = 255 - rgb[2}; 
setEntry(/, rgb); 

} 


2.36 Wliat if the image in Prob. 2.35 is in the 24-bit true color representation? 
SOLUTION 
int i. j, rg/>[3]; 

for ( j = 0 ;j< height; /++) 
for (/ = 0: i < width; /++) { 
getPixel(/, /, rgb); 
rg/)[0] = 255 - rg/»[0); 

'■g6[lj = 255 - rg6[l]; 
rgb[ 2] = 255 - rgb[2]; 
setPixel(i. /, rgb); 

} 


2.37 Calculate the sum and product of 0.5 + 2.0/ and 1.0 — 1.0/. 

SOLUTION 

(0.5 + 1.0) + (2.0 + (- 1 .0))/ =1.5+1 .0/ 

(0.5 x 1.0 - 2.0 x (-1.0)) + (0.5 x (-1.0) + 2.0 x 1.0)/ = 2.5 + 1.5/ 


2.38 Calculate the square of the two complex numbers in Prob. 2.37. 

SOLUTION 

(0.5 2 - 2.0 2 ) + 2 x 0.5 x 2.0/ = -3.75 + 2.0/ 

(1.0 2 — (— 1.0) 2 ) + 2 x 1.0 x(- 1.0)/ = 0.0 -2.0/ 


2.39 Show that 1 +254 x count/.V provides a proportional mapping from count in [0, /V] to c in 
[1,255], 
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SOLUTION 

Proportional mapping means that we want 

(c - 1 )/(255 - I ) = (count - 0 )/(N - 0) 
Hence c = 1 4- 254 x count /A. 


2.40 Modify the pseudo code for visualizing the Mandelbrot set to visualize the Julia sets. 

SOLUTION 

int i,j, count; 

float delta = (Re_max — Re_min)/width; 
for (/ = 0, .v.real = Re_min; i < width; i++, _v.real+ = delta) 
for (j = 0, x.imag = Im_min; j < height; /'++, x.imag+ = delta) { 
count = 0; 

while (|x| < 2.0 && count < N ) { 
compute x = x 2 + z; 
count++; 

} 

if (|x| < 2.0) setColortO); 

else setColor(l + 254*countW); 

setPixel(/,7); 

1 

2.41 How to avoid the calculation of square root in an actual implementation of the algorithms for 
visualizing the Mandelbrot and Julia sets? 

SOLUTION 

Test for |x| 2 < 4.0 instead of |x| < 2.0. 


Supplementary Problems 

2.42 Can a 5 by 3 j inch image be presented at 6 by 4 inch without introducing geometric distortion? 

2.43 Retering to Prob. 2.42, what if the original is by 3 ’ inch? 

2.44 Given the portrait image of a person, describe a simple way to make the person look more slender. 


2.45 An RGB color image can be converted to a gray-scale image using the formula 0.299R 4- 0.587G -t- 0. 1 1 4B 
for gray levels (see Chap. 11, Sec. 11.1 under “The NTSC YIQ Color Model”). Assuming that 
getPixelfv, y. rgb) now r reads pixel values from a 24-bit input image and setPixel(x, v. i) assigns pixel 
values to an output image that uses a gray-scale lookup table, develop a pseudo-code segment to convert 
the input image to a gray-scale output image. 




Scan Conversion 


Mauy pictures, from 2D drawings to projected views of 3D objects, consist of graphical primitives 
such as points, lines, circles, and filled polygons. These picture components are often defined in a 
continuous space at a higher level of abstraction than individual pixels in the discrete image space. For 
instance, a line is defined by its two endpoints and the line equation, whereas a circle is defined by its 
radius, center position, and the circle equation. It is the responsibility of the graphics system or the 
application program to convert each primitive from its geometric definition into a set of pixels that make up 
the primitive in the image space. This conversion task is generally referred to as scan conversion or 
rasterization. 

The focus of this chapter is on the mathematical and algorithmic aspects of scan conversion. Wc 
discuss ways to handle several commonly encountered primitives including points, lines, circles, ellipses, 
characters, and filled regions in an efficient and effective manner. We also discuss techniques that help to 
“smooth out" the discrepancies between the original element and its discrete approximation. The 
implementation of these algorithms and mathematical solutions (and many others in subsequent chapters) 
varies from one system to another and can be in the fonn of various combinations of hardware, firmware, 
and software. 


3.1 SCAN-CONVERTING A POINT 

A mathematical point ft, y) where x and y are real numbers within an image area, needs to be scan- 
converted to a pixel at location (pc', /). This may be done by making x to be the integer part oft, and / the 
integer part of y. In other words, x' = Floor(t) and V — Floor( v), where function Floor returns the largest 
integer that is less than or equal to the argument. Doing so in essence places the origin of a continuous 
coordinate system for (t, v) at the lower left comer of the pixel grid in the image space [see Fig. 3-1 («)]. All 
points that satisfy x / < x < x' + I and V < y < y + 1 are mapped to pixel (x',y'). For example, point P ] 

( 1 .7, 0.8) is represented by pixel (1,0). Points P-, (2.2, 1 .3) and P, (2.8, 1 .9) are both represented by pixel 
(2,1). 

Another approach is to align the integer values in the coordinate system for (x,y) with the pixel 
coordinates [see Fig. 3-1(6)]. Here we scan convert (x, v) by making x' = Floor(x+ 0.5) and V — 
Floor(v + 0.5). This essentially places the origin of the coordinate system for (x, v) at the center of 
pixel (0, 0). All points that satisfy x! — 0.5 < x < x' + 0.5 and y — 0.5 < y < V + 0.5 are mapped to pixel 
(x\/). This means that points P x and P 2 are now both represented by pixel (2, 1), whereas point P 3 is 
represented by pixel (3,2). 
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Fig. 3-1 Scan-converting points. 

We will assume, in the following sections, that this second approach to coordinate system alignment is 
used. Thus all pixels are centered at the integer values of a continuous coordinate system where abstract 
graphical primitives are defined. 


3.2 SCAN-CONVERTING A LINE 

A line in computer graphics typically refers to a line segment, which is a portion of a straight line that 
extends indefinitely in opposite directions. It is defined by its two endpoints and the line equation 
y — nix + b, where m is called the slope and b the y intercept of the line. In Fig. 3-2 the two endpoints are 
described by and P 2 (x 2 , >'2 )• Tire line equation describes the coordinates of all the points that lie 

between the two endpoints. 



A note of caution: this slope intercept equation is not suitable for vertical lines. Horizontal, vertical, 
and diagonal (|m| — 1) lines can, and often should, be handled as special cases without going through the 
following scan-conversion algorithms. These commonly used lines can be mapped to the image space in a 
straightforward fashion for high execution efficiency. 


Direct Use of the Line Equation 

A simple approach to scan-converting a line is to first scan-convert P x and P 2 to pixel coordinates 
(*i,y,) and (x > 2 , >2 ), respectively; then set m = (y' 2 — y r \)/{x’ 2 — Xj) and b — y 2 — mx\ . If |m| < 1, then for 
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every integer value of x between and excluding x', and x J 2 , calculate the corresponding value of y using the 
equation and scan-convert (.r,y). If \m\ > 1, then for every integer value of y between and excluding v\ and 
y 2 , calculate the corresponding value of x using the equation and scan-convert (x,y). 

While this approach is mathematically sound, it involves floating-point computation (multiplication 
and addition) in every step that uses the line equation since m and b are generally real numbers. The 
challenge is to find a way to achieve the same goal as quickly as possible. 


DDA Algorithm 

The digital differential analyzer (DDA) algorithm is an incremental scan-conversion method. Such an 
approach is characterized by perfonning calculations at each step using results from the preceding step. 
Suppose at step ; we have calculated (x, , y,. ) to be a point on the line. Since the next point (x, +1 , _v, + i ) should 
satisfy Ay/ Ax -- m where Ay = y /+1 — y, and Ax = x i+1 — x„ we have 

v,+ i = v, + mAx 
or 

x, +1 = x, + Ay/m 

These formulas are used in the DDA algorithm as follows. When \m\ < I , we start with x = Xj 
(assuming that x' l < x' 2 ) and y=yj, and set Ax = 1 (i.e., unit increment in the x direction). The y 
coordinate of each successive point on the line is calculated using y i+] =y,- + m. Wlten |w| > 1, we start 
with x = x'j and y = y', (assuming that y' l < yV), and set Ay = 1 (i.e., unit increment in the y direction). The 
x coordinate of each successive point on the line is calculated using x, +1 = x, + 1 /in. This process 
continues until x reaches x t 2 (for the |»?| < 1 case) or y reaches y? (for the |m| > 1 case) and all points 
found are scan-converted to pixel coordinates. 

The DDA algorithm is faster than the direct use of the line equation since it calculates points on the 
line without any floating-point multiplication. However, a floating-point addition is still needed in 
determining each successive point. Furthermore, cumulative error due to limited precision in the 
floating-point representation may cause calculated points to drift away from their true position when the 
line is relatively long. 


Bresenham’s Line Algorithm 

Bresenham’s line algorithm is a highly efficient incremental method for scan-converting lines. It 
produces mathematically accurate results using only integer addition, subtraction, and multiplication by 2, 
which can be accomplished by a simple arithmetic shift operation. 

The method works as follows. Assume that we want to scan-convert the line shown in Fig. 3-3 where 
0 < m < 1. We start with pixel P\ {x\ , y, ), then select subsequent pixels as we work our way to the right, 
one pixel position at a time in the horizontal direction towards /^(x',,/,). Once a pixel is chosen at any 
step, the next pixel is either the one to its right (which constitutes a lower bound for the line) or the one to 
its right and up (which constitutes an upper bound for the line) due to the limit on m. The line is best 
approximated by those pixels that fall the least distance from its true path between P\ and P’ 2 . 

Using the notation of Fig. 3-3, the coordinates of the last chosen pixel upon entering step i are (x,-,y f ). 
Our task is to choose the next one between the bottom pixel S and the top pixel T. If S is chosen, we have 
x J+1 — x ; + 1 and y i+1 =y t . If T is chosen, we have x /+1 = x, + 1 and y i+] =y i + 1. The actual y 
coordinate of the line at x — x i+1 is y - mx J+1 + b — m(Xj + 1) + b. The distance from S to the actual line 
in the y direction is s — y — y, . The distance from T to the actual line in the y direction is i = (y, + 1 ) — y. 
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Fig. 3-3 Scan-converting a line. 


Now consider the difference between these two distance values: s — t. When 5 — t is less than zero, we 
have s < t and the closest pixel is S. Conversely, when s — t is greater than zero, we have s > t and the 
closest pixel is T. We also choose T when s — / is equal to zero. This difference is 

s-t = (y-y,) - [(Vi + 1) - y] 

— 2y - 2y ( - - 1 = 2m(x i + 1) + 2b - 2y,- - 1 

Substituting m by Ay/ Ax and introducing a decision variable d- : — Ax(s — /), which has the same sign as 
(s — t) since Ax is positive in our case, we have 

cl, = 2Ay*x ( - - 2Ax*y, + C where C — 2 A y + Av( 2b — 1 ) 

Similarly, we can write the decision variable d,+\ for the next step as 

d i+l = 2Ay*x, +) - 2Ax*v, + 1 + C 

Tlien 


d i+ 1 - d t = 2Ar(x )+1 - x,) - 2Ax(v i+1 -y ( ) 

Since x /+1 = x,- + 1, we have 

d i+ 1 = d, + 2 Ay - 2Ax(y i+1 - y,) 

If the chosen pixel is the top pixel T (meaning that d, > 0) then y, +1 =y t + 1 and so 

d i+ 1 — dj + 2(Ay - Ax) 

On the other hand, if the chosen pixel is the bottom pixel S (meaning that <■/, < 0) then y i+1 =y f and so 

dj^\ — di + 2 Ay 

Hence we have 


I d, + 2(Ay - Ax) if d, > 0 
di+l -(</,. + 2 Ay if d, < 0 

Finally, we calculate d } , the base case value for this recursive formula, from the original definition of the 
decision variable d{, 

d\ = Ax[2/w(X] + 1) + 2 b — 2y! — 1] 

= Ax[2(wx! + b — y, ) + 2m — 1] 
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Since mx l + b — y l = 0, we have 


d l — 2A v — Ax 

In summary. Bresenham’s algorithm for scan-converting a line from P\ (x\ , /, ) to P 2 {x J 2 ,y 2 ) with 
x', < x > 2 and 0 < m < 1 can be stated as follows: 

int .v = Vj , y = y\ : 

ini dx = x > 2 — x'j , dy = )- 2 — , dT = 2(dy - dx), dS = 2 dy; 

int d = 2 dy — dx; 

setPixel(x, y); 

wliile (x < x 2 ) { 
x++; 

if (d < 0) 
d = d + dS ; 
else { 
y++; 

d = d + dT; 

} 

setPixel(x, v); 

} 

Here we first initialize decision variable d and set pixel P\ . During each iteration of the while loop, we 
increment x to the next horizontal position, then use the current value of d to select the bottom or top 
(increment v) pixel and update d, and at the end set the chosen pixel. 

As for lines that have other m values we can make use of the fact that they can be mirrored either 
horizontally, vertically, or diagonally into tins 0° to 45° angle range. For example, a line from (x' ] , y \ ) to 
(•* 2 <> 2 ) with — 1 <»i<0 has a horizontally mirrored counterpart from (x' ] , ) to (x' 2 , — 14 ) with 

0 < m < 1. We can simply use the algorithm to scan-convert this counterpart, but negate the y coordinate 
at the end of each iteration to set the right pixel for the line. For a line whose slope is in the 45 to 90° 
range, we can obtain its mirrored counterpart by exchanging the x and y coordinates of its endpoints. We 
can then scan-convert this counterpart but we must exchange x and y in the call to setPixel. 


3.3 SCAN-CONVERTING A CIRCLE 

A circle is a symmetrical figure. Any circle-generating algorithm can take advantage of the circle’s 
symmetry to plot eight points for each value that the algorithm calculates. Eight-way symmetry is used by 
reflecting each calculated point around each 45° axis. For example, if point 1 in Fig. 3-4 were calculated 
with a circle algorithm, seven more points could be found by reflection. The reflection is accomplished by 
reversing the x, y coordinates as in point 2, reversing the x, y coordinates and reflecting about the y axis as 
in point 3, reflecting about the v axis as in point 4, switching the signs of x and y as in point 5, reversing the 
x, y coordinates, reflecting about the y axis and reflecting about the x axis as in point 6, reversing the x, v 
coordinates and reflecting about the y axis as in point 7, and reflecting about the x axis as in point 8. 

To summarize: 


Pi =(x,y) P 5 — (—x, —y) 

P 2 = (y, x) P 6 - (-y, -x) 

Pj - (~y, *) Pi — (y, -x) 

P 4 = ( x, y) P s - (x, -y) 
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Fig. 3-4 Eight-way symmetry of a circle. 

Defining a Circle 

There are two standard methods of mathematically defining a circle centered at the origin. The first 
method defines a circle with the second-order polynomial equation (see Fig. 3-5) 

y 2 = r 2 — x i 

where x = the x coordinate 
y = the y coordinate 
r — the circle radius 

With tins method, each x coordinate in the sector, from 9 0 to 45°, is found by stepping x from 0 to 
rj s/2, and each y coordinate is found by evaluating %/ r 2 — x 2 for each step of x. Tliis is a very inefficient 
method, however, because for each point both x and r must be squared and subtracted from each other; then 
the square root of the result must be found. 


) 




Fig. 3-5 Circle defined with a second-degree polynomial equation. 
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The second method of defining a circle makes use of trigonometric functios (see Fig. 3-6): 

x = r cos 9 y — r sin 9 


where 9 — current angle 
r — circle radius 
x = x coordinate 
y = v coordinate 

By this method, 0 is stepped from 9 to n/4, and each value of .y and y is calculated. However, 
computation of the values of sin 9 and cos 0 is even more time-consuming than the calculations required by 
the first method. 



Brcscnham's Circle Algorithm 

If a circle is to be plotted efficiently, the use of trigonometric and power functions must be avoided. 
And, as with the generation of a straight line, it is also desirable to perform the calculations necessary to 
find the scan-converted points with only integer addition, subtraction, and multiplication by powers of 2. 
Bresenham ’s circle algorithm allows these goals to be met. 

Scan-converting a circle using Bresenham’s algorithm works as follows. If the eight-way symmetry of 
a circle is used to generate a circle, points will only have to be generated through a 45° angle. And, if points 
are generated from 90° to 45 A moves will be made only in the +x and — y directions (see Fig. 3-7). 

The best approximation of the true circle will be described by those pixels in the raster that fall the 
least distance from the true circle. Examine Fig. 3-8. Notice that, if points are generated from 90 and 45°, 
each new point closest to the true circle can be found by taking either of two actions: (1) move in the x 
direction one unit or (2) move in the x direction one unit and move in the negative y direction one unit. 
Therefore, a method of selecting between these two choices is all that is necessary to find the points closest 
to the true circle. 

Assume that (.v ; ,y ; ) are the coordinates of the last scan-converted pixel upon entering step i (see Fig. 
3-8). Let the distance from the origin to pixel T squared minus the distance to the true circle 
squared — D(T). Then let the distance from the origin to pixel S squared minus the distance to the true 
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Fig. 3-7 Circle scan-converted with Bresenham's algorithm. 

circle squared = £>(S). As the coordinates of T are (x f + 1 , >,) and those of S are (x, + 1 , v t - - I), the 
following expressions can be developed: 

D( T) = (x, + 1 f + yf - r 2 D( S) = (x, + 1 ) 2 + ( y, - 1 ) 2 - r 2 

Tliis function D provides a relative measurement of the distance from the center of a pixel to the true 
circle. Since D(T) will always be positive (T is outside the true circle) and D(S) will always be negative (S 
is inside the true circle), a decision variable d t may be defined as follows: 

dj = D(T) + Z)(S) 

Therefore 

dj — 2(Xj 4- 1) 2 +yf + ( V'; — 1)^ — 2r 



Fig. 3-8 Choosing pixels in Bresenham’s circle algorithm. 
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When dj < 0, we have |D(T)| < D(S)| and pixel T is chosen. When d t > 0, we have |D(T)| > |D(S)| 
and pixel S is selected. We can also write the decision variable d j+i for the next step: 

d-,+i = 2(x, +1 + 1)" +yf + 1 + CVi+i - 1 ) 2 - 2r 2 

Hence 

di + 1 - 4 = 2(*/+i + l) 2 +yf + 1 + CV/+1 - I) 2 - 2(x, + l) 2 -y 2 - (V; - l) 2 
Since x, +1 = x,- + 1, we have 

d i+ 1 = d, + 4x, + 2(yf +i -y 2 ) - 2(.y, +1 ->• ) + 6 
If T is the chosen pixel (meaning that d, < 0) then y i+1 = v, and so 

dj^ i = dj + 4 x, + 6 

On the other hand, if S is the chosen pixel (meaning that d t > 0) then y i+1 =y ( — 1 and so 

dj + 1 = dj + 4(x, — v',) + 1 0 

Hence we have 

, _ | dj + 4x, + 6 if dj < 0 

i+1 “ | dj + 4(x f - y,) +10 if d ( > 0 

Finally, we set (0. r) to be the starting pixel coordinates and compute the base case value c/, for this 
recursive formula from the original definition of <-/, ; 

dj = 2(0 + 1 ) 2 + r 2 + (r - l) 2 - 2r 2 = 3-2 r 

We can now summarize the algorithm for generating all the pixel coordinates in the 90° to 45° octant 
that are needed when scan-converting a circle of radius r: 

int x = 0, y = r, d = 3 — 2 r, 
while (x <= y) { 
setPixel(x, _y); 
if (d < 0) 
d = d + 4x + 6: 
else { 

d — d + 4(x — _v) + 10: 

y — ; 

} 

x++; 

} 

Note that during each iteration of the while loop we first set a pixel whose position has already been 
determined, starting with (0, r). We then test the current value of decision variable d in order to update d 
and determine the proper y coordinate of the next pixel. Finally we increment x. 


Midpoint Circle Algorithm 

We present another incremental circle algorithm that is very similar to B resen ham's approach. It is 
based on the following function for testing the spatial relationship between an arbitrary point (x, y) and a 
circle of radius r centered at the origin: 

< 0 (x, y) inside the circle 

= 0 (x, y) on the circle 

> 0 (x, y) outside the circle 


f(x,y) = x 2 +_y 2 — r 2 
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Now consider the coordinates of the point halfway between pixel T and pixel S in Fig. 3-8: 
(x ; + 1 , y,- — j). This is called the midpoint and we use it to define a decision parameter: 

Pi =/(*, + I . V/ - 1) = (*, + 1 ) 2 + Cv, - \f ~ f 2 

If Pj is negative, the midpoint is inside the circle, and we choose pixel T. On the other hand, if /?, is positive 
(or equal to zero), the midpoint is outside the circle (or on the circle), and we choose pixel S. Similarly, the 
decision parameter for the next step is 

Pi + 1 = ( x i+i + I) 2 + Cv, + i - 5) 2 - r 2 

Since x /+1 — .r, + 1 , we have 

Pi\ t — Pi — [(*, + •) + if — (*f + l) 2 + 0/+i — |) 2 — (v, — j) 2 

Hence 


Pi+i = Pi + 2(x, + 1 ) + 1 + (v 2 +1 - vf) - (v /+1 - yd 

If pixel T is chosen (meaning/?, < 0), we havey I+I = y,. On the other hand, if pixel S is chosen (meaning 
Pi > 0), we have y /+1 = — 1. Thus 

_\Pi + 2(x,- + 1) + 1 if Pi < 0 

Pi+l _ \p, + 2( Xj + 1) + 1 - 2(y, - 1) if Pi > 0 


We can continue to simplify this in terms of (x,,jy) and get 


| Pi + 2x, + 3 
1 P; + 2(x, — >’,) + 5 


Or we can write it in tenns of (x l+1 ,y, +1 ) and have 


Pi+l 


= \ Pi 
\Pi 


+ 2x 

+ 2(x, +1 - }’i + 1 ) + 1 


i+i 


if Pi < 0 
>f Pi > 0 


if Pj < 0 
if Pi > 0 


Finally, we compute the initial value for the decision parameter using the original definition of p t and (0, /*): 

/?, = (0+l) 2 +(r-i) 2 -r 2 =f-r 


One can see that this is not really integer computation. However, when r is an integer we can simply 
set /;, = !— r. Tire error of being £ less than the precise value does not prevent /;, from getting the 
appropriate sign. It does not affect the rest of the scan-conversion process either, because the decision 
variable is only updated with integer increments in subsequent steps. 

The following is a description of this midpoint circle algorithm that generates the pixel coordinates in 
the 90° to 45° octant: 

int x = 0, y = r, p = 1 — r; 

while (x <= y) { 
setPixel(x, y); 
if (p < 0) ' 
p =p + 2x + 3; 
else { 

P — P + 2(x — y) + 5; 

y — ; 


x++; 

} 
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Arbitrarily Centered Circles 

In the above discussion of the circle algorithms we have assumed that a circle is centered at the origin. 
To scan-convert a circle centered at (x c ,y c ). we can simply replace the setPixel(x, v) statement in the 
algorithm description with setPixeK.r + x c , v +y c ). The reason for this to work is that a circle centered at 
(x c , v'c) can be viewed as a circle centered at the origin that is moved by x c and y c in the x and y direction, 
respectively. We can achieve the effect of scan-converting this arbitrarily centered circle by relocating scan- 
converted pixels in the same way as moving the circle’s center from the origin. 


3.4 SCAN-CONVERTING AN ELLIPSE 

The ellipse, like the circle, shows symmetry. In the case of an ellipse, however, symmetry is four- 
rather than eight-way. There are two methods of mathematically defining a ellipse. 


Polynomial Method of Defining an Ellipse 

Tire polynomial method of defining an ellipse (Fig. 3-9) is given by the expression 

(x-h) 2 (y-k) 2 

a 2 b 2 


where (h,k) — ellipse center 

a = length of major axis 
b — length of minor axis 

When the polynomial method is used to define an ellipse, the value of x is incremented from h to a. For 
each step ofx, each value of y is found by evaluating the expression 


v = b 


(x-h) 2 


+ k 


This method is very inefficient, however, because the squares of a and (x-h) must be found; then floating- 
point division of (x — hf by cr and floating-point multiplication of the square root of [1 — (x — hf /a 2 ] by 
b must be performed (see Prob. 3.20). 

Routines have been found that will scan-convert general polynomial equations, including the 
ellipse. However, these routines are logic intensive and thus are very slow methods for scan-converting 
ellipses. 



Fig. 3-9 Polynomial description of an ellipse. 
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Trigonometric Method of Defining an Ellipse 

A second method of defining an ellipse makes use of trigonometric relationships (see Fig. 3-10). The 
following equations define an ellipse trigonometrically: 

x — a cos(0) + h and y — b sin(fi) + k 

where ft, y) = the current coordinates 
a — length of major axis 
b = length of minor axis 
6 — current angle 
(, h , k) = ellipse center 

For generation of an ellipse using the trigonometric method, the value of 0 is varied from 0 to n/2 
radians (rad). Tire remaining points are found by symmetry. While this method is also inefficient and thus 
generally too slow for interactive applications, a lookup table containing the values for sin(0) and cos (0) 
with 6 ranging from 0 to 7r/2rad can be used. This method would have been considered unacceptable at 
one time because of the relatively high cost of the computer memory used to store the values of 9. 
However, because the cost of computer memory has plummeted in recent years, tins method is now quite 
acceptable. 



Ellipse Axis Rotation 

Since the ellipse shows four-way symmetry, it can easily be rotated 90°. The new equation is found by 
trading a and b, the values which describe the major and minor axes. When the polynomial method is used, 
the equations used to describe the ellipse become 

(x - hf , (y - k) 2 , 

b 2 + a 2 


where (/?, k) = ellipse center 

a = length of major axis 
h — length of minor axis 
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When the trigonometric method is used, the equations used to describe the ellipse become 

x— bcos(9) + h and y — a sin(0) + A 

where (x, y) — current coordinates 
a — length of major axis 
b = length of minor axis 
6 = current angle 
(h, k) = ellipse center 

Assume that you would like to rotate the ellipse through an angle other than 90 r . It can be seen from 
Fig. 3-11 that rotation of the ellipse may be accomplished by rotating the a- and y axis a degrees. When this 
is done, the equations describing the x, y coordinates of each scan-converted point become 

x = a cos(fl) - b sin(0 + a) + h y = b sin(fl) + a cos (0 + a) + A- 



Fig. 3-1 1 Rotation of an ellipse. 


Midpoint Ellipse Algorithm 


This is an incremental method for scan-converting an ellipse that is centered at the origin in standard 
position (i.e., with its major and minor axes parallel to the coordinate system axes). It works in a way that is 
very similar to the midpoint circle algorithm. However, because of the four-way symmetry property we 
need to consider the entire elliptical curve in the first quadrant (see Fig. 3-12). 

Let’s first rewrite the ellipse equation and define function / that can be used to decide if the midpoint 
between two candidate pixels is inside or outside the ellipse: 


f(x,y) = b 2 x 2 + a 2 y 2 - crh 2 


< 0 
= 0 
> 0 


(x, y) inside the ellipse 
(x, y) on the ellipse 
(x, y) outside the ellipse 
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Now divide the elliptical curve from (0,6) to (a, 0) into two parts at point Q where the slope of the 
curve is - 1 . Recall that the slope of a curve defined by f(x, y) = 0 is dy/dx = -fa /fa, where fa and fa are 
partial derivatives of/(x,v) with respect to x and y, respectively. We have fa = 2 fax, fa = 2cry. and 
dy/dx = —2b 2 x/2cry, Tliis shows that the slope of the curve changes monotonically from one side of Q to 
the other. Hence we can monitor the slope value during the scan-conversion process to detect Q. 

Our starting point is (0, b). Suppose that the coordinates of the last scan-converted pixel upon entering 
step i are (x,, y f ). We are to select either T(„v, 1 , yj or S(.v, + 1 , y, — 1 ) to be the next pixel. The midpoint 

of the vertical line connecting T and S is used to define the following decision parameter: 

Pi = /(*/ + 1 ,yt-j) = b 2 (x t + 1 ) 2 + a 2 (y, - \) 2 - erb 2 

If Pi < 0. the midpoint is inside the curve, and we choose pixel T. On the other hand, if p, > 0, the 
midpoint is outside or on the curve, and we choose pixel S. Similarly, we can write the decision parameter 
for the next step: 

Pi + 1 “/(**+ 1 + 1.T/+1 - 5) = b 2 (x i+l + I) 2 + fl 2 (.v,+i “ ~ a 2 b 2 

Since x i+l = x t + 1, we have 

Pi + 1 -Pi = + *) 2 - xf+i] + a 2 [(.v J+1 - 1) 2 - (y, - 1) 2 ] 

Hence 

Ph i =Pt + 26 2 x, +1 +b 2 + a 2 [(y i+l - \f - (y, - ^) 2 ] 

If T is the chosen pixel (meaning p, < 0), we have y, +1 = _y,-. On the other hand, if pixel S is chosen 
(meaning p, > 0), we have y i+1 =y f — 1. Tlius we can express p i+1 in tenns of p, and (x i+1 ,.y ;+1 ): 

_ | Pi + 2 b 2 x m + b 2 if pj < 0 

,} ‘ 1 1 I p, + 2 b 2 x i+l + b 2 - 2 a 2 y i+1 if p, > 0 

Tlie initial value for this recursive expression can be obtained by evaluating the original definition of p, 
with (0,6): 


Pi = 6 2 + a 2 (b — ^) 2 — a 2 b 2 = b 2 — erb + a 2 / 4 
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We now move on to derive a similar formula for part 2 of the curve. Suppose pixel (x,, y.) has just been 
scan-converted upon entering step j. The next pixel is either U(x ; ,y — 1) or V(x + 1 , V, — 1). The 
midpoint of the horizontal line connecting U and V is used to define the decision parameter 

<7/ =f(Xj + \ , yj - 1) = b 2 (xj + i) 2 + a 2 (y t - 1 ) 2 - a 2 b 2 

If th < 0, the midpoint is inside the curve and V is chosen. On the other hand, if > 0, it is outside or on 
the curve and U is chosen. We also have 

<7/+i =/(*,+ 1 +5..V ;+ i - 1) = b 2 (x j+l + i) 2 + a 2 (y J+l - l) 2 - a 2 if 

Since v- +1 — y>j — 1, we have 

<?/+ 1 _ <7, = ^ 2 [i x /+i + ?) 2 — ( x j + j) 2 ] + « 2 [(.V) + , — 1 ) 2 — .v 2 + ,J 

Hence 

<7 / +l = q, + b 2 [(x j+ j + £) 2 - (xj + |) 2 J - 2 a 2 y j+i + a 2 

If V is the chosen pixel (meaning q t < 0), we have x /+1 = x,- + 1. On the other hand, if U is chosen 
(meaning q t > 0), we have x ;+1 =x ; . Titus we can express c/ f+l in terms of q : and (x, +1 ,_v ;+] ): 

_\<lj + 2b 2 x H i - 2 a 2 y j+1 + cr if < 0 

C,!+l - 2a 2 y J+ \ + a 2 if qj > 0 

The initial value for this recursive expression is computed using the original definition of q t and the 
coordinates (x k ,y k ) of the last pixel chosen for part 1 of the curve: 

<i\ =f(x k + 2 , V* - 1) = b 2 (x k + i) 2 + a 2 (y k - 1 ) 2 - a 2 b 2 

We can now' put together a pseudo-code description of the midpoint algorithm for scan-converting the 
elliptical curve in the first quadrant. There are a couple of technical details that are worth noting. First, we 
keep track of the slope of the curve by evaluating the partial derivatives fx and fy at each chosen pixel 
position. This means that /x = 2/rx, and fy — 2ci 2 y i for position (x, , y,)- Since we have x, +1 = x, + 1 and 
y l+ 1 = y t or y, - 1 for part 1, and x J+l = x, or xj + 1 andy /+1 = y-j - 1 for part 2, the partial derivatives can 

be updated incrementally using 2 b 2 and/or —hr. For example, if x /+1 =x,-+ 1 and y /+1 =y t — 1, the 

partial derivatives for position (x, +1 ,y, +1 ) are 26 2 x J+1 — 2 b 2 x t + 2 b 2 and 2cfy l+l — 2 a 2 y t — 2a 2 . Second, 
since 2/rx ;+l and 2 a 2 y l+i appear in the recursive expression for the decision parameters, we can use them 
to efficiently compute p i+l as well as q . + 1 : 

int x = 0, y = b; /* starting point */ 
int aa — a*a, bb — b*b, aa2 — aa* 2, bb2 — bb* 2; 
int fa — 0 ,fy — aa2*b; /* initial partial derivatives */ 
int p — bb — aa*b + 0.25 *««; /* compute and round off p l */ 

while (fx <fv) { /* |slope| < 1 */ 
setPixel(x, yj; 
x++; 

fx = fa + bb2\ 
if (p < 0) 
p = p +Jx + bb; 
else { 
y — ; 

fy — fy — aal; 
p—p+fx + bb -fy; 

) 

} 

setPixel(x, y); /* set pixel at (x k .v k ) */ 
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p = bb(x + 0.5)(jt + 0.5) + aa(y — 1 ){_v — 1) — aa*bb; /* set q l */ 
while (y > 0) { 
y — ; 

fy —fy - c/c/2; 

if(p>= 0) 

p-p —Jy + aa > 
else { 

*++; 

ft = fit + bb2: 
p=p +fit -Jy + aa: 

} 

setPixel(jc, y); 

} 


3.5 SCAN-CONVERTING ARCS AND SECTORS 
Arcs 

An arc [Fig. 3-13(cz)] may be generated by using either the polynomial or the trigonometric method. 
When the trigonometric method is used, the starting value is set equal to 0, and the ending value is set 
equal to B 2 [see Figs. 3-13(cz) and 3- 13(b)], The rest of the steps are similar to those used when scan- 
converting a circle, except that symmetry is not used. 




Kig. 3-13 


When the polynomial method is used, t he value of x is varied from .v t to x 2 and the values of y are 
found by evaluating the expressio v/r 2 — x 2 (Fig. 3-14). 

From the graphics programmer’s point of view, arcs would appear to be nothing more than portions of 
circles. However, problems occur if algorithms such as Bresenham’s circle algorithm are used in drawing 
an arc. In the case of Bresenham’s algorithm, the endpoints of an arc must be specified in terms of the x, y 
coordinates. The general formulation becomes inefficient when endpoints must be found (see Fig. 3-15). 
This occurs because the endpoints for each 45° increment of the arc must be found. Each of the eight 
points found by reflection must be tested to see if the point is between the specified endpoints of the arc. As 
a result, a routine to draw an arc based on Bresenham’s algorithm must take the time to calculate and test 
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Fig. 3-14 


every point on the circle’s perimeter. There is always the danger that the endpoints will be missed when a 
method like this is used. If the endpoints are missed, the routine can become caught in an infinite loop. 



Sectors 

A sector is scan-converted by using any of the methods of scan-converting an arc and then scan- 
converting two lines from the center of the arc to the endpoints of the arc. 

For example, assume that a sector whose center is at point (/?, A’) is to be scan-converted. First, scan- 
convert an arc from to 0 2 . Next, a line would be scan-converted from (h, k) to (rcos(d, ) + /?. 
r sin(^! ) + A'). A second line would be scan-converted from (h,k) to (rcos(ik) + A, rsin(0 2 ) + A). 


3.6 SCAN-CONVERTING A RECTANGLE 

A rectangle whose sides are parallel to the coordinate axes may be constructed if the locations of two 
vertices are known [see Fig. 3- 16(a)], The remaining comer points are then derived [see Fig. 3- 1 6(A)], 
Once the vertices are known, the four sets of coordinates are sent to the line routine and the rectangle is 
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Fig. 3-16 


scan-converted. In the case of the rectangle shown in Figs. 3-16(«) and 3-16(/r), lines would be drawn as 
follows: line (xj.j?]) to (Xi,y 2 ); line (aj , y 2 ) to (x 2 , y 2 ): line (x 2 , v 2 ) to (x 2 , Vj); and line (x 2 ,y 1 ) to (X[, V]). 


3.7 REGION FILLING 

Region filling is the process of “coloring in” a definite image area or region. Regions may be defined 
at the pixel or geometric level. At the pixel level, we describe a region either in tenns of the bounding 
pixels that outline it or as the totality of pixels that comprise it (see Fig, 3-1 7). In the first case the region is 
called boundary-defined and the collection of algorithms used for filling such a region are collectively 
called boundary-fill algorithms. The other type of region is called an interior-defined region and the 
accompanying algorithms are called flood-fill algorithms. At the geometric level a region is defined or 
enclosed by such abstract contouring elements as connected lines and curves. For example, a polygonal 
region, or a filled polygon, is defined by a closed polyline, which is a polyline (i.e., a series of sequentially 
connected lines) that has the end of the last line connected to the beginning of the first line. 


Boundary-defined region 



(a) 


Interior -defined region 

o o o o o o 

o o o o o o o 

ooooooooo 
o o o o o o o 

ooooooooo 
ooo ooooo 
o o o o o o o 

( b ) 


Fig. 3-17 


4-Connected vs. 8-Connectcd 

An interesting point here is that, while a geometrically defined contour clearly separates the interior 
of a region from the exterior, ambiguity may arise when an outline consists of discrete pixels in the 
image space. There are two ways in which pixels are considered connected to each other to form a 
“continuous” boundary. One method is called 4-connected, where a pixel may have up to four neighbors 
[see Fig. 3-1 8(a)]; the other is called 8-connected, where a pixel may have up to eight neighbors [see 
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(«) 


(b) (c) 

Fig. 3-18 4-connected vs. 8-connected pixels. 


Fig. 3-1 8(Z>)]. Using the 4-connected approach, the pixels in Fig. 3- 18(c) do not define a region since 
several pixels such as A and B are not connected. However using the 8-connected definition we identify a 
triangular region. 

We can further apply the concept of connected pixels to decide if a region is connected to another 
region. For example, using the 8-connected approach, we do not have an enclosed region in Fig. 3- 1 8(c) 
since “interior” pixel C is connected to “exterior” pixel D. On the other hand, if we use the 4-connected 
definition we have a triangular region since no interior pixel is connected to the outside. 

Note that it is not a mere coincidence that the figure in Fig. 3-1 8(c) is a boundary -defined region when 
we use the 8-connected definition for the boundary pixels and the 4-counected definition for the interior 
pixels. In fact, using the same definition for both boundary and interior pixels would simply result in 
contradiction. For example, if we use the 8-connected approach we would have pixel A connected to pixel 
B (continuous boundary) and at the same time pixel C connected to pixel D (discontinuous boundary). On 
the other hand, if we use the 4-connectd definition we would have pixel A disconnected from pixel B 
(discontinuous boundary) and at the same time pixel C disconnected from pixel D (continuous boundary). 


A Boundary-till Algorithm 

This is a recursive algorithm that begins with a starting pixel, called a seed, inside the region. The 
algoritlnn checks to see if this pixel is a boundary pixel or has already been filled. If the answer is no, it fills 
the pixel and makes a recursive call to itself using each and every neighboring pixel as a new seed. If the 
answer is yes, the algorithm simply returns to its caller. 

This algorithm works elegantly on an arbitrarily shaped region by chasing and filling all non-boundary 
pixels that are connected to the seed, either directly or indirectly through a chain of neighboring relations. 
However, a straightforward implementation can take time and memory to execute due to the potentially 
high number of recursive calls, especially when the size of the region is relatively large. 

Variations can be made to limit the number of recursive calls by structuring the order in which 
neighboring pixels are processed. For example, we can first fill pixels to the left and right of the seed on the 
same scan line until boundary pixels are hit (something that can be done using a loop control structure). We 
then inspect each pixel above and below the line just drawn (which can also be done with a loop) to see if it 
can be used as a new seed for the next horizontal line to fill. This way the number of recursive calls at any 
particular time is merely N when the current line is N scan lines away from the initial seed. 


A Flood-fill Algorithm 

This algorithm also begins with a seed (starting pixel) inside the region. It checks to see if the pixel has 
the region’s original color. If the answer is yes, it fills the pixel with a new color and uses each of the pixel’s 
neighbors as a new seed in a recursive call. If the answer is no, it returns to the caller. 

This method shares great similarities in its operating principle with the boundary-fill algoritlnn. It is 
particularly useful when the region to be filled has no uniformly colored boundary. On the other hand, a 
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region that has a well-defined boundary but is itself multiply colored would be better handled by the 
boundary-fill method. 

The execution efficiency of this flood-fill algorithm can be improved in basically the same way as 
discussed above regarding the boundary-fill algorithm. 


A Scan-line Algorithm 

In contrast to the boundary-fill and flood-fill algorithms that fill regions defined at the pixel level in the 
image space, this algorithm handles polygonal regions that are geometrically defined by the coordinates of 
their vertices (along with the edges that connect the vertices). Although such regions can be filled by first 
scan-converting the edges to get the boundary pixels and then applying a boundary-fill algorithm to finish 
the job, the following is a much more efficient approach that makes use of the information regarding edges 
that are available during scan conversion to facilitate the filling of interior pixels. 

We represent a polygonal region in terms of a sequence of vertices V t , V 2 , that are connected 

by edges £j , E 2 , E } , . . . , (see Fig. 3-19). We assume that each vertex V t has already been scan-converted to 
integer coordinates (.r,,y,). 



The algorithm begins with the first scan line the polygon occupies and proceeds line by line towards 
the last scan line. For each scan line it finds all intersection points between the current scan line and the 
edges. For example, scan line y intersects edges £,, E 7 , E b , and E 4 at points a, b, c, and b, respectively. The 
intersection points are sorted according to their .t coordinates and grouped into pairs such as (a, b) and 
(c, cl). A line is drawn from the first point to the second point in each pair. 

Horizontal edges are ignored since the pixels that belong to them are automatically filled during scan 
conversion. For example, edge E 5 is drawn when the corresponding scan line is processed. The two 
intersection points between the scan line and edges E b and E 4 are connected by a line that equals 
exactly E s . 

Now we take a more careful look at cases when a scan line intersects a vertex. If the vertex is a local 
minimum or maximum such as V l and V 7 , no special treatment is necessary. The two edges that join at the 
vertex will each yield an intersection point with the scan line. These two intersection points can be treated 
just like any other intersection points to produce pairs of points for the filling operation. As for vertices V 5 
and V 6 , they are simply local minimums, each with one joining edge that produces a single intersection 
point. On the other hand, if a scan line intersects a vertex (e.g., V 4 ) that is joined by two monotonically 
increasing or decreasing edges, getting two intersection points at that vertex location would lead to 
incorrect results (e.g., a total of three intersection points on the scan line that intersects V 4 ). The solution to 
this problem is to record only one intersection point at the vertex. 

In order to support an efficient implementation of this scan line algorithm we create a data structure 
called an edge list (see Table 3-1). Each non-horizontal edge occupies one row/record. Information stored 
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Table 3-1 An edge list. 


Edge 

A’nun 

3 max 

x coordinate of 
vertex with y = y„ lm 

l/m 

Ex 

Ti 

y 2 — 1 

*1 

I/m, 

e 7 

>i 

y? 

*1 

l/m 7 


ys 

V4 - 1 

*5 

l/w 4 

E 6 

>’6 

y? 

x 6 

l/m 6 

e 2 

y% 

y-i 

X 2 

1 /»h 

Ei 

y* 


x 4 

\/m 3 


in each row includes the y coordinate of the edge’s two endpoints, with v min being the smaller value and 
3max the larger value (may be decreased by 1 for reasons to be discussed below), the x coordinate of the 
endpoint whose y coordinate is y min , and the inverse of the edge’s slope m. The rows are sorted according to 
y mu) . Going back to our example in Fig. 3-19, since edges E ] and E n both originate from the lowest vertex 
f'i at (x^ Ti), they appear on top of the edge list, with m x and m 1 being their slope value, respectively. 

Edges in the edge list become active when the y coordinate of the current scan line matches their y mlll 
value. Only active edges are involved in the calculation of intersection points. The first intersection point 
between an active edge and a scan line is always the lower endpoint of the edge, whose coordinates are 
already stored in the edge’s record. For example, when the algorithm begins at the first scan line, edges £, 
and E 1 become active. They intersect the scan line at (x, .y,). 

Additional intersection points between an edge and successive scan lines can be calculated 
incrementally. If the edge intersects the current scan line at (x, y), it intersects the next scan line at 
(x + 1 /m,y + 1 ). For example, edge E n intersects scan line y at point b, and so the next intersection point 
on scan line y + 1 can be calculated by Ax = l/m 7 since Ay = 1 . This new x value can simply be kept in 
the x field of the edge’s record. 

An edge is deactivated or may even be removed from the edge list once the scan line whose y 
coordinate matches its y nmx value lias been processed, since all subsequent scan lines stay clear from it. The 
need that was mentioned early to give special treatment to a vertex where two monotonicaUy increasing or 
decreasing edges meet is elegantly addressed by subtracting one from the y niax value of the lower edge. 
This means that the lower edge is deactivated one line before the scan line that intersects the vertex. Thus 
only the upper edge produces an intersection point with that scan line (see V 2 in Fig. 3-19). This explains 
why the initial y max value for edges E } and E 4 has been decreased by one. 


3.8 SCAN-CONVERTING A CHARACTER 

Characters such as letters and digits are the building blocks of an image’s textual contents. They can be 
presented in a variety of styles and sizes. The overall design style of a set of characters is referred to as its 
typeface or font. Commonly used fonts include Arial, Century Schoolbook, Courier, and Times New 
Roman. In addition, fonts can vary in appearance: bold, italic, and bold and italic. Character size is 
typically measured in height in inches, points (approximately ^ inch), and picas (12 points). 


Bitmap Font 

There are two basic approaches to character representation. The first is called a raster or bitmap font, 
where each character is represented by the on pixels in a bilevel pixel grid pattern called a bitmap (see Fig. 
3-20). This approach is simple and effective since characters are defined in already-scan-converted form. 
Putting a character into an image basically entails a direct mapping or copying of its bitmap to a specific 
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location in the image space. On the other hand, a separate font consisting of scores of bitmaps for a set of 
characters is often needed for each combination of style, appearance, and size. 

Although one might generate variations in appearance and size from one font, the overall results tends 
to be less than satisfactory. The example in Fig. 3-21 shows that we may overlay the bitmap in Fig. 3-20 
onto itself with a horizontal offset of one pixel to produce (a) bold, and shift rows of pixels in Fig. 3-20 to 
produce (b) italic. 



(a) Bold (b) Italic 

Fig. 3-21 Generating variations in appearance. 


Furthermore, the size of a bitmap font is dependent on image resolution. For example, a font using 
bitmaps that are 12 pixels high produces 12-point characters in an image with 72 pixels per inch. However, 
the same font will result in 9-point characters in an image with 96 pixels per inch, since 12 pixels now- 
measure 0.125 inch, which is about 9 points. To get 12-point characters in the second image we would need 
a font with bitmaps that are 16 pixels high. 


Outline Font 

The second character representation method is called a vector or outline font, where graphical 
primitives such as lines and arcs are used to define the outline of each character (see Fig. 3-22). Although 
an outline definition tends to be less compact than a bitmap definition and requires relatively time- 
consuming scan-conversion operations, it can be used to produce characters of varying size, appearance, 
and even orientation. For example, the outline definition in Fig. 3-22 can be resized through a scaling 
transformation, made into italic through a shearing transformation, and turned around with respect to a 
reference point through a rotation transformation (see Chap. 4). 

These transformed primitives can be scan-converted directly into characters in the fonn of filled 
regions in the target image area. Or they can be used to create the equivalent bitmaps that are then used to 
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produce characters. This alternative is particularly effective in limiting scan-conversion time when the 
characters have repetitive occurrences in the image. 


3.9 ANTI-ALIASING 

Scan conversion is essentially a systematic approach to mapping objects that are defined in continuous 
space to their discrete approximation. The various forms of distortion that result from this operation are 
collectively referred to as the aliasing effects of scan conversion. 


Staircase 

A common example of aliasing effects is the staircase or jagged appearance we see when scan- 
converting a primitive such as a line or a circle. We also see the stair steps or “jaggies” along the border of 
a filled region. 


Unequal Brightness 

Another artifact that is less noticeable is the unequal brightness of lines of different orientation. A 
slanted line appears dimmer than a horizontal or vertical line, although all are presented at the same 
intensity level. The reason for this problem can be explained using Fig. 3-23, where the pixels on the 
horizontal line are placed one unit apart, whereas those on the diagonal line are approximately 1.414 units 
apart. This difference in density produces the perceived difference in brightness. 



Fig. 3-23 


X 
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The Picket Fence Problem 

The picket fence problem occurs when an object is not aligned with, or does not fit into, the pixel grid 
properly. Figure 3-24(o) shows a picket fence where the distance between two adjacent pickets is not a 
multiple of the unit distance between pixels. Scan-converting it normally into the image space will result in 
uneven distances between pickets since the endpoints of the pickets will have to be snapped to pixel 
coordinates [see Fig. 3-24(6)]. This is sometimes called global aliasing, as the overall length of the picket 
fence is approximately correct. On the other hand, an attempt to maintain equal spacing will greatly distort 
the overall length of the fence [see Fig, 3-24(c)]. This is sometimes called local aliasing, as the distances 
between pickets are kept close to their true distances. 


(n) 


(b) 


(c) 


Fig. 3-24 The picket fence problem. 


Another example of such a problem arises with the outline font. Suppose we want to scan-convert the 
uppercase character “E” in Fig. 3-25 {a) from its outline description to a bitmap consisting of pixels inside 
the region defined by the outline. The result in Fig. 3-25(6) exhibits both asymmetry (the upper arm of the 
character is twice as thick as the other parts) and dropout (the middle arm is absent). A slight adjustment 
and/or realignment of the outline can lead to a reasonable outcome [see Fig. 3-25(c)]. 



(a) (b) (c) 

Fig. 3-25 Scan -converting an outline font. 


Anti-aliasing 

Most aliasing artifacts, when appear in a static image at a moderate resolution, are often tolerable, and 
in many cases, negligible. However, they can have a significant impact on our viewing experience when 
left untreated in a series of images that animate moving objects. For example, a line being rotated around 
one of its endpoints becomes a rotating escalator with length-altering steps. A moving object with small 
parts or surface details may have some of those features intermittently change shape or even disappear. 
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Although increasing image resolution is a straightforward way to decrease the size of many aliasing 
artifacts and alleviate their negative impact, we pay a heavy price in terms of system resource (going from 
W x H to 2 W x 2 H means quadrupling the number of pixels) and the results are not always satisfactory. 
On the other hand, there are techniques that can greatly reduce aliasing artifacts and improve the 
appearance of images without increasing their resolution. These techniques are collectively referred to as 
anti-aliasing techniques. 

Some anti-aliasing techniques are designed to treat a particular type of artifact. For instance, an outline 
font can be associated with a set of rules or hints to guide the adjustment and realignment that is necessary 
for its conversion into bitmaps of relatively low resolution. An example of such approach is called the 
TrueType font. 


Pre-filtering and Post-filtering 

Pre-filtering and post-filtering are two types of general-purpose anti-aliasing techniques. The concept 
of filtering originates from the field of signal processing, where true intensity values are continuous signals 
that consists of elements of various frequencies. Constant intensity values that correspond to a uniform 
region are at the low end of the frequency range. Intensity values that change abruptly and correspond to a 
sharp edge are at the high end of the spectrum. In order to lessen the jagged appearance of lines and other 
contours in the image space, we seek to smooth out sudden intensity changes, or in signal-processing 
terms, to filter out the high frequency components. A pre-filtering technique works on the true signal in the 
continuous space to derive proper values for individual pixels (filtering before sampling), whereas a post- 
filtering technique takes discrete samples of the continuous signal and uses the samples to compute pixel 
values (sampling before filtering). 


Area Sampling 

Area sampling is a pre-filtering technique in which we superimpose a pixel grid pattern onto the 
continuous object definition. For each pixel area that intersects the object, we calculate the percentage of 
overlap by the object. This percentage determines the proportion of the overall intensity value of the 
corresponding pixel that is due to the object’s contribution. In other words, the higher the percentage of 
overlap, the greater influence the object has on the pixel’s overall intensity value. 

In Fig. 3-26(a) a mathematical line shown in dotted form is represented by a rectangular region that is 
one pixel wide. The percentage of overlap between the rectangle and each intersecting pixel is calculated 
analytically. Assuming that the background is black and the line is white, the percentage values can be used 
directly to set the intensity of the pixels [see Fig. 3-26(A)]. On the other hand, had the background been 
gray (0.5. 0.5, 0.5) and the line green (0, 1,0), each blank pixel in the grid w'ould have had the background 
gray value and each pixel filled with a fractional number / would have been assigned a value of 
[0.5(1 -/), 0.5(1 —f) +/, 0.5(1 — /)]- a proportional blending of the background and object colors. 

Although the resultant discrete approximation of the line in Fig. 3-26 (b) takes on a blurry appearance, 
it no longer exhibits the sudden transition from an on pixel to an off pixel and vice versa, which is what we 











































































(a) 


(b) 


fr) 


Fig. 3-26 Area sampling. 
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would get with an ordinary scan-conversion method [see Fig. 3-26(c)]. This tradeoff is characteristic of an 
anti-aliasing technique based on filtering. 


Super Sampling 

In this approach we subdivide each pixel into subpixels and check the position of each subpixel in 
relation to the object to be scan-converted. The object’s contribution to a pixel’s overall intensity value is 
proportional to the number of subpixels that are inside the area occupied by the object. Figure 3-27 (a) 
shows an example where we have a white object that is bounded by two slanted lines on a black 
background. We subdivide each pixel into nine (3 x 3) subpixels. The scene is mapped to the pixel values 
in Fig. 3-27(6). The pixel at the upper right comer, for instance, is assigned \ since seven of its nine 
subpixels are inside the object area. ITad the object been red (1,0,0) and the background light yellow 
(0.5. 0.5, 0), the pixel would have been assigned (1 x | + 0.5 x | , 0.5 x |, 0), which is (| , 5 , 0). 


0 

2/9 

7/9 

6/9 

1 

8/9 

1 

8/9 

1/9 



(a) 


ft’) 


Fig. 3-27 Super sampling. 


Super sampling is often regarded as a post-filtering technique since discrete samples are first taken and 
then used to calculate pixel values. On the other hand, it can be viewed as an approximation to the area 
sampling method since we are simply using a finite number of values in each pixel area to approximate the 
accurate analytical result. 


Lowpass Filtering 

This is a post-filtering technique in which we reassign each pixel a new value that is a weighted 
average of its original value and the original values of its neighbors. A lowpass filter in the form of a 
(2« + 1) x (2 n + 1) grid, where n > 1, holds the weights for the computation. All weight values in a filter 
should sum to one. An example of a 3 x 3 filter is given in Fig. 3-2 8 (a). 

To compute a new value for a pixel, we align the filter with the pixel grid and center it at the pixel. The 
weighted average is simply the sum of products of each weight in the filter times the corresponding pixel’s 
original value. The filter shown in Fig, 3-28 (a) means that half of each pixel’s original value is retained in 
its new value, while each of the pixel’s four immediate neighbors contributes one eighth of its original 
value to the pixel’s new value. The result of applying this filter to the pixel values in Fig. 3-26(c) is shown 
in Fig. 3-28 (b). 

A lowpass filter with equal weights, sometimes referred to as a box filter, is said to be doing 
neighborhood averaging. On the other hand, a filter with its weight values conforming to a two- 
dimensional Gaussian distribution is called a Gaussian filter. 
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(a) (*) 

Fig. 3-28 Lowpass filtering. 


Pixel Phasing 

Pixel phasing is a hardware-based anti-aliasing technique. Tire graphics system in this case is capable 
of shifting individual pixels from their normal positions in the pixel grid by a fraction (typically | and |) of 
the unit distance between pixels. By moving pixels closer to the true line or other contour, this technique is 
very effective in smoothing out the stair steps without reducing the sharpness of the edges. 


3.10 EXAMPLE: RECURSIVELY DEFINED DRAWINGS 

In this section we use two common graphical primitives to produce some interesting drawings. Each of 
these drawings is defined by applying a modification rule to a line or a filled triangle, breaking it into 
smaller pieces so the rule can be used to recursively modify each piece in the same manner. As the pieces 
become smaller and smaller, an intriguing picture emerges. 


C Curve 

A fine by itself is a first-order C curve, denoted by C 0 (see Fig. 3-29). The modification rule for 
constructing successive generations of the C curve is to replace a line by two shorter, equal-length lines 
joining each other at a 90° angle, with the original line and the two new lines forming a right-angled 
triangle. See Fig. 3-29 for C ls C 2 , C 3 , C 4 , and C 5 . 



Presume that the following call to the graphics library causes a fine to be drawn from (x, , y x ) to (x 2 , y 2 ) 
using the system’s current color: 

linefv, ,V!, x 2 ,v 2 ) 
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We can describe a pseudo-code procedure that generates C„ : 

C-curve (float x, v, len, alpha; int n) 

{ 

if (k > 0) { 

len = len/sqrt(2.0); 

C-curve(x, y, len, alpha + 45, n - 1); 
x = x + len*cos(alpha + 45); 
y =y + len*sin(alpha + 45); 

C-curve(.Y, y, len, alpha — 45, n — 1); 

} else 

line(x, v, x + len*cos(alpha), v + len*sin(alpha)); 

} 

where x and y are the coordinates of the starting point of a line, len the length of the line, alpha the angle 
(in degrees) between the line and the .v axis, and n the number of recursive applications of the modification 
rule that is necessary to produce C„. If n — 0, no modification is done and the line itself is drawn. 
Otherwise, two properly shortened lines, with one rotated counter-clockwise by 45 r and the other 
clockwise by 45° from the current line position, are generated (representing one application of the 
modification rule), each of which is the basis of the remaining n — 1 steps of recursive construction. 


The Koch Curve 

As in the case of the C curve, a line by itself is a first-order Koch curve, denoted by K 0 (see Fig. 3-30). 
The modification rule for constructing successive generations of the Koch curve is to divide a line into 
three equal segments and replace the middle segment with two lines of the same length (the replaced 
segment and the two added lines fonn an equilateral triangle). See Fig. 3-30 for K), K 2 , and AT,. 

K g K , K 2 A'j 



Fig. 3-30 Successive generations of the Koch curve. 


The Sicrpinski Gasket 

This time our graphical primitive is a filled triangle, denoted by S 0 (see Fig. 3-31). The modification 
rule for constructing successive generations of the Sierpinski gasket is to take out the area defined by the 
lines connecting the midpoint of the edges of a filled triangle, resulting in three smaller ones that are 
similar to the original. See Fig. 3-31 for 5\, S 2 , and .S', . 
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Fig. 3-31 Successive generations of the Sierpinski gasket. 


Solved Problems 


3.1 The endpoints of a given line are (0, 0) and (6, 1 8). Compute each value of y as x steps from 0 to 6 and plot the 
results. 

SOLUTION 

An equation for the line was not given. Therefore, the equation of the line must be found. The equation of 
the line (v' = mx + b) is found as follows. First the slope is found: 

Ay y 2 — .V] 18 — 0 18 
Ax x 2 — x t 6 — 0 6 

Next, the y intercept b is found by plugging V] and x t into the equation y = 3x + b: 0 = 3(0) + b. Therefore, 
b = 0, so the equation for the line is y = 3x (see Fig. 3-32). 


3.2 What steps are required to plot a line whose slope is between 0° and 45° using the slope-intercept 
equation? 

SOLUTION 

1 . C ompute dx: dx = x 2 — x, . 

2. Compute dv: dy = y 2 —.Vi- 

3. Compute m: m = dy/dx. 

4. Compute b: b =yj — m x x, . 

5. Set (x,y) equal to the lower left-hand endpoint and set x cn(1 equal to the largest value of.v. If dx < 0, then 
x =x 2 ,y = y 2 , and x end = x t . If dx > 0, then x = x t , y = V| , and x end = x 2 . 

6. Test to determine whether the entire line has been drawn. If x > x elllt , stop. 

7. Plot a point at the current (x, y) coordinates. 

8. Increment x: x = x + I. 

9. Compute the next value of y from the equation y = mx + b. 

10. Go to step 6. 

3.3 Use pseudo-code to describe the steps that are required to plot a line whose slope is between 45° 
and -45° (i.e., \m\ > 1) using the slope-intercept equation. 
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SOLUTION 

Presume y, <>'2 tor the two endpoints (xj.yi ) and (x 2 ,y 2 )' 
int x = x lt y =y\, 

float Xf, m = (y 2 — V\)/(x 2 - x,), b=y , — mx x ; 
setPixel (x, y); 
while (.v < v 2 ) { 
v++; 

Xf = Cv - b)/m; 
x = Floor(xy 4- 0.5); 
setPixel(x. v); 

} 


3.4 Use pseudo-code to describe the DDA algorithm for scan-converting a line whose slope is between 
-45 rj and 45° (i.e., |m| < 1). 

SOLUTION 

Presume x, < x 2 for the two endpoints (xj,_Vi) and (x 2 , y 2 ): 
int x = x, , y; 

float y f = y , , m = (y 2 — y, )/(x 2 - x,); 
while (x <=x 2 ) { 
y = l ; loor(y ; - + 0.5); 
setPixel(x,y); 
x++: 

V/ — \y + m; 

} 
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3.5 Use pseudo-code to describe the DDA algorithm for scan-converting a line whose slope is between 
45° and —45° (i.e., \m\ > 1). 

SOLUTION 

Presume Vi <y 2 for the two endpoints (x, , v'i ) and (x 2 ,y 2 ): 
int x, y = V] ; 

float x f = x,, m inv = (x 2 - x,)/( v 2 - Vj ); 
while fv <=y 2 ) f 
x = FloorU^- + 0.5); 
setPixel(x,.v); 
x f =x f + m mv ; 
v++; 

> 


3.6 What steps are required to plot a line whose slope is between 0° and 45° using Bresenham’s 
method? 

SOLUTION 

1. Compute the initial values: 

dx = x 2 — xj Inc 2 = 2(dy — dx) 

dy = v 2 — V) d = 7«C] — dx 

lnc\ — 2dy 

2. Set (x, v) equal to the lower left-hand endpoint and x end equal to the largest value of x. If dx < 0, then 
x = x 2 , v = v 2 , x cnd = x, . If dx > 0, then x = x,, v = v, , x end = x 2 . 

3. Plot a point at the current (x, v) coordinates. 

4. Test to see whether the entire line has been drawn. If x = x md , stop. 

5. Compute the location of the next pixel. If d < 0, then d = d + Inc l . If d > 0, then d — d + Iitc 2 , and 
then v = y+ 1. 

6. Increment x: x = x -1- 1 . 

7. Plot a point at the current (x, v) coordinates. 

8. Go to step 4. 


3.7 Indicate which raster locations would be chosen by Bresenham’s algorithm when scan-converting a 
line from pixel coordinate (1, 1) to pixel coordinate (8,5). 

SOLUTION 

First, the starting values must be found. In this case 

dx = x 2 — x, = 8 — 1 = 7 dy = y 2 — V\ = 5—1=4 

Therefore: 

Inc x = 2dy = 2x4 = 8 
Inc 2 = 2 (dy — dx) = 2 x (4 — 7) = — 6 
d = Inc i — dx = 8 — 7 = I 

The following table indicates the values computed by the algorithm (see also Fig. 3-33). 
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Fig. 3-33 


3.8 In the derivation of Bresenham’s line algorithm we have used s and t to measure the closeness of 
pixels S and T to the true line. However, s and t are only distances in the y direction. They are not 
really distances between a point to a line as defined in geometry. Can we be sure that, when s — t, 
the two pixels S and T are truly equally far away from the true line (hence we can choose either one 
to approximate the line)? 

SOLUTION 

As we can see in Fig. 3-34, when 5 = t the true line intersects the vertical line connecting S and T at 
midpoint. The true distance from S to the line is dS and that from T to the line is dJ. Since the two right- 
angled triangles are totally equal (they have equal angles and one pair of equal edges s and t), we get d S = dJ. 



3.9 Modify the description of Bresenham’s line algorithm in tire text to set all pixels from inside the 
loop structure. 

SOLUTION 1 

int x = V] , v = y \ ; 

int dx = Xj — Xj , dy = y 2 — yj , dl = 2 (dy — dx), dS = 2 dy; 
int d = 2 dy — dx: 
while (x <= x r 2 ) { 
setPixel(A-,_v); 

*++; 

if (d < 0) 
d = d + dS; 
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else { 

V++; 

d = d + </T; 

} 


SOLUTION 2 

int x =x' t - I ,y = v\; 

int dx = x 2 — Jtj, dy = >2 — Vj, dT = 2 (dy — dx), dS = 2 dy; 
int d = —dx; 
while (x < X2) { 
a++; 

if (d < 0) 
d = d + r/S; 
else { 
v++; 

d = d + dT, 

} 

setPixel(x,_v); 


3.10 What steps are required to generate a circle using the polynomial method? 

SOLUTION 

1 . Set the initial variables: r = circle radius; (/;, k) = coordinates of the circle center; x = 0; i = step size; 
*end = rjsfl. 

2. Test to determine whether the entire circle has been scan-converted. If a > stop. 

3. Compute the value of the y coordinate, where y = -Jr 1 — a 2 . 

4. Plot the eight points, found by symmetry with respect to the center (h, k), at the cunent (x, v) coordinates: 


Plot(A + h,y -|- k) 
Plot(y + /;. a + k) 
Plot( — v + h. a 4- A - ) 
Plot( — a + h.y + k) 


Plot( —a + h,—y + k ) 
Plot(— v + h. —a + A) 
Plot(y + h. —x + k) 
Plot(x + h, —v + A) 


5. Increment x: x = x + i. 

6. Go to step 2. 


3.11 What steps are required to scan-convert a circle using the trigonometric method? 

SOLUTION 

1 . Set the initial variables: r = circle radius; (h, A) = coordinates of the circle center; / = step size; 
$ end = jt/4 radians = 45°; 6 = 0. 

2. Test to detennine whether the entire circle has been scan-converted. If 0 > 0 end , stop. 

3. Compute the value of the a and y coordinates: 


x = r cos (6) y = r sin(0) 
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4. Plot the eight points, found by symmetry with respect to the center (/;, A), at the current (x, y) coordinates: 


Plot(x + h,y + k) 
Plot(y + h.x 4- k) 
Plot(— y 4- A, x + k) 
Plot(— x 4 h,y + k) 


Plot(— x 4- h, —y 4- A) 
Plot(— v 4- A, — x 4- A) 
Plotfy + h, -x + k) 
Plotfx + h, -v 4- k) 


5. Increment 6: 6 = 6 + i. 

6. Go to step 2. 


3.12 


What steps are required to scan-convert a circle using Bresenham’s algorithm? 


SOLUTION 

1 . Set the initial values of the variables: (h, k) = coordinates of circle center; x = 0; v = circle radius r; and 
d = 3 - 2r. 

2. Test to detennine whether the entire circle has been scan-converted. If x > v, stop. 

3. Plot the eight points, found by symmetry with respect to the center (h, k), at the current (x, y) coordinates: 


Plot(x 4- h, y + A) 
PIot(y + h,x + k) 
Plot) — y + h,x + k) 
Plot( — x + h, y -I- k) 


Plot(— x 4- h , — y + A) 
Plot(— y 4- A, — x + A) 
Plot(y 4- A, —x 4- A) 
Plot(x 4- h, —y -I- A) 


4. Compute the location of the next pixel. If d < 0, then d = d 4-4x4- 6 and x = x4- 1. If d > 0, then 
d = d 4- 4(x — y) 4- 10, x = x + 1, and y = y — 1 . 

5. Go to step 2. 


3.13 When eight -way symmetry is used to obtain a full circle from pixel coordinates generated for the 0° 
to 45° or the 90° to 45° octant, certain pixels are set or plotted twice. This phenomenon is 
sometimes referred to as overstrike. Identify where overstrike occurs. 

SOLUTION 

At locations resulted from the initial coordinates (r, 0) or (0, r) since (0, r) = (—0, r), (0, — r) = (—0. — r), 
(r. 0) = (r, -0), and (-r. 0) = (-r. -0). 

In a ddition, if the last generated pixel is on the diagonal line at (a r, a r) where a approximates 
1 / v /(2. 0) = 0.7071, then overstrike also occurs at (ar, a r), (— cur. a r), (ocr. —or), and (— ar, — ar). 


3.14 Is overstrike harmful besides wasting time? 

SOLUTION 

It is often hannless since resetting a pixel with the same value does not really change the image in the 
frame buffer. However, if pixel values are sent out directly, for example, to control the exposure of a 
photographic medium, such as a slide or a negative, then overstrike amounts to double exposure at locations 
where it occurred. 

Furthermore, if we set pixels using their complementary colors, then oversuike would leave them 
unchanged, since complementing a color twice simply yields the color itself. 


3.15 When scan-converting a curve using the polynomial method (see Probs. 3.10, 3.20, and 3.25) or the 
trigonometric method (see Probs. 3.11, 3.21, and 3.24), a step size i is used to compute successive 
points on the true curve. These points are then mapped to the image space. What happens if / is too 
large? What happens if it is too small? 
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SOLUTION 

If / is too large, the computed points will be relatively far from each other and the corresponding pixels 
will not fonn a continuous curve. 

If i is too small, computed points will be so close to each other that two or more adjacent ones will be 
mapped to the same pixel, resulting in overstrike. 

Note that it is not always possible to find a single step size for a specific scan-conversion task that yields a 
continuous curve without overstrike. In such cases we may take an adaptive approach in which adjustments 
are made to step size during scan-conversion based on points that have already been mapped to the image 
space. For example, if two consecutively computed points have been mapped to two pixels that are not 
connected to each other, then an additional point between the two points may be computed using half the step 
size. 

3.16 Will the following description of Bresenham’s circle algorithm and the one in the text produce the 
same results? 

int x = 0, v = r, d = 3 - 2r\ 

setPixel(x.y); 

while ( x < v) { 
if (d < 0) 
d = cl + 4x -I- 6; 
else { 

d = cl + 4(x — v) -F 1 0; 

y — ; 

) 

x++; 

setPixeljx, v); 

I 

SOLUTION 

Let A be the correct set of pixels chosen by Bresenham’s circle algorithm. Both versions produce A w'hen 
the rightmost pixel in A is on the diagonal line x = v. However, when the coordinates of the rightmost pixel in 
A satisfies x = _v — 1 , only the version in the text stops properly. This version will produce one additional pixel 
beyond the 90° to 45 c octant. This extra pixel mirrors the rightmost pixel in A with respect to the diagonal 
line. 

3.17 In the derivation of Bresenham’s circle algorithm we have used a decision variable d t — 
Z)(T) + £>(S) to help choose between pixels S and T. However, function D as defined in the text 
is not a true measure of the distance from the center of a pixel to the true circle. Show that when 
dj — 0 the two pixels S and T are not really equally far away from the true circle. 

SOLUTION 

Let cIS be the actual distance from S to the true circle and (IT be the actual distance from T to the true 
circle (see Fig. 3-35). Also substitute x for x, + 1 and.y for y, in the formula for d, to make the following proof 
easier to read: 

d i = 2X 2 +y 2 + (y- I) 2 - 2r = 0 
Since (r + cfT) 2 = x 2 + v 2 and (r — r/S) 2 = x 2 + (y — 1 ) 2 we have 

2rdT -I- (IT 1 = x 2 -f v 2 — r 2 and — 2n/S + dS 1 = x 2 + ( v — 1 ) 2 — r 2 . 

Flence 

2rdJ + clT 2 - 2rdS + dS 2 = 0 
dT(2r + dT) = dS(2r — dS) 

Since dT/dS = (2r — dS)/(2r + dT) < 1, we have dT < dS. This means that, when <(, = 0, pixel T is 
actually closer to the true circle than pixel S. 
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3.18 Write a description of the midpoint circle algorithm in which decision parameter p is updated using 
x i+i and » + i instead of x ; and y,. 

SOLUTION 


int x = 0, v = r, p = 1 — r: 
while (.v <=y) { 
setPixel(x, y); 
x++; 

if ( p < 0) 
p =p + 2x+ 1 ; 
else { 
y — ; 

p=p + 2(x-y) + 1; 

} 

1 


3.19 Will the following description of the midpoint circle algorithm and the one in Prob. 3.18 produce 
the same results? 

int x = 0, y = r, p = 1 — r; 
setPixel(.v,y); 
while (x < y) { 
x++; 

if (p < 0) 
p =p + 2x+ 1: 
else { 
y — ; 

p = p + 2(x — y) + 1 ; 

} 

setPixel(x, y); 

I 

SOLUTION 

Similar to the solution for Prob. 3.16. Only the version in Prob. 3.18 produces the correct set of pixels 
under all circumstances. 
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3.20 What steps are required to generate an ellipse using the polynomial method? 

SOLUTION 

1 . Set the initial variables: a = length of major axis; b = length of minor axis; (/;, k) = coordinates of 
ellipse center; x = 0; i = step size; .v cnd = a. 

2. Test to detennine whether the entire ellipse has been scan-converted. If .v > x cmi , stop. 

3. Compute the value of the y coordinate: 


v = bj 1 - -r 


4. Plot the four points, found by symmetry, at the current (x, y) coordinates: 


Plot(x 4- It, y 4- k) Plot ( — x 4- h, — y 4- k) 

Plot( — x + h, y 4- k) Plot(x 4- h, — v + k) 


5. Increment x: x = x + i. 

6. Go to step 2. 


3.21 What steps are required to scan-convert an ellipse using the trigonometric method? 

SOLUTION 

1. Set the initial variables: a — length of major axis; b — length of minor axis; (It, k) — coordinates of 
ellipse center; i — counter step size; 6 etuJ = n/2\ 9 = 0. 

2. Test to determine whether the entire ellipse has been scan-converted. If 9 > 9 enA , stop. 

3. Compute the values of the x and y coordinates: 

x = a cos (0) y = b sin(fl) 

4. Plot the four points, found by symmetry, at the current (x,y) coordinates: 

Plot(x -l- h, v 4- k) Plot(— x 4- h, — y 4- k) 

Plot( —x 4- h, y 4- k) Plot(.v 4- It, — y 4- k) 

5. Increment 9: 9 = 9 + i. 

6. Go to step 2. 


3.22 When four-way symmetry is used to obtain a full ellipse from pixel coordinates generated for the 
first quadrant, does overstrike occur? Where? 

SOLUTION 

Overstrike occurs at (0 ,b), (0 ,—b), (a, 0), and (— a,0) since (0, b) = (— 0, b), (0, — b) = (— 0, — b), 
(a, 0) = (a, —0), and (—a, 0) = (—a, —0). 


3.23 In the midpoint ellipse algorithm we have used only the coordinates (x k ,y k ) of the last pixel chosen 
for part 1 of the curve to compute the initial value c/, of the decision parameter q. for part 2. Can we 
also make use of the last value of the decision parameter pp. 
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SOLUTION 

The last computed value of the decision parameter p, for part 1 of the curve is 

Pk =f( x k + '-V* -|) = b \ x k + l) 2 + a 2 (vjt - i) 2 - a 2 b 2 
= h 2 (4 + 2x k + 1 ) + a 2 (/ k - y k + j) - a 2 b 2 

Since 

9\ =f( x k +1-^ - 0 = b 2 (x k + \ ) 2 + a 2 (v k - 1 ) 2 - a 2 b 2 
= 6‘(x| + •**+?) + tf2 ( v I — 2y* + 1 ) — a~b~ 

We have 

?i = Pk - b Hxk +|) - a 2 (v k - f) 

3.24 Wliat steps are required to scan-convert an arc using the trigonometric method? 

SOLUTION 

1 . Set the initial variables: a — major axis: b -- minor axis; (/;, k) = coordinates of arc center; i = step size; 
6 = starting angle; 0 } = ending angle. 

2. Test to determine whether the entire arc has been scan-converted. If 8 > 0,, stop. 

3. Compute the values of the x and y coordinates; 

x — a cos(G) + h v = a sin(fl) -1- k 

4. Plot the points at the current (x, v) coordinates; Plot(x', v). 

5. Increment 8: 8 —8 + i. 

6. Go to step 2. 

(Note: for the arc of a circle a = b = circle radius r.) 

3.25 Wliat steps are required to generate an arc of a circle using the polynomial method? 

SOLUTION 

1. Set the initial variables: r = radius: (h, k) = coordinates of arc center; x =x coordinate of start of arc; 
x, = x coordinate of end of arc; / = counter step size. 

2. Test to detennine whether the entire arc has been scan-converted. If x > x,, stop. 

3. Compute the value of the y coordinate: 

y = s/r 1 — x 1 

4. Plot at the current (x, yj coordinates: 

Plot(x + h.y + k) 

5. Increment x: x = x 4- i. 

6. Go to step 2. 

3.26 Wliat steps are required to scan-convert a rectangle whose sides are parallel to the coordinate axes? 

SOLUTION 

1. Set initial variables: (Xj, Vj) = coordinates of first point specified: (x 2 , y 2 ) = coordinates of second point 
specified. 
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2. Plot the rectangle: 

Plot^.y,) to (jc 2 , >> , ) Plot(x 2 ,v 2 ) to (x, , v 2 ) 

PIot(x 2 . V| ) to (x 2 , v 2 ) Plot(x,. v 2 ) to (x,, Vi) 

3.27 How would a Hood-fill algorithm fill the region shown in Fig. 3-36, using the 8-connected definition 

for region pixels? 

SOLUTION 

1. Assume that a seed is given at coordinate 3, 3. The flood-fill algorithm will inspect the eight points 

surrounding the seed (4, 4; 3, 4; 2, 4; 2, 3; 2, 2: 3, 2; 4, 2; 4, 3). Since all the points surrounding the seed 

have the regions original color, each point will be filled (see Fig. 3-37). 

2. Each of the eight points found in step 1 becomes a new seed, and the points surrounding each new seed 
are inspected and filled. This process continues until all the points surrounding all the seeds are rid of the 
region’s original color (see Fig. 3-38). 



3.28 Write a pseudo-code procedure to implement the boundary-fill algorithm in the text in its basic 
form, using the 4-connected definition for region pixels. 

SOLUTION 

BoundaryFill (int x, v, fill_color, boundary _co lor) 

{ 

int color; 

getPixcl(x, y\ color); 

if (color != boundary _color && color != fill_color) { 
setPixelfr, v, fill_color); 

BoundarylTH(x + 1, y, fill_color, boundary _color); 

BoundaryFill(x, y 4- 1, fill_color, boundary _color); 

Boundary FilKx — 1, y, fill_color, boundary_color): 

BoimdaryFill(.v, v — 1, fill_coIor, boundary_color): 

} 

} 


3.29 Write a pseudo-code procedure to implement the flood-fill algorithm in the text in its basic form, 
using the 4-connected definition for region pixels. 
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SOLUTION 

FloodFill (int x,y, fill_color, original_color) 

{ 

int color; 

getPixel(x, v, color); 
if (color == original_color) { 
sctPixeI(x, y, fill_color); 

FloodFill(x + 1 , y, fill_color, original_color); 
FloodFiIl(x, y + 1 , fill_color, original_color); 
FloodFill(.r — 1, v, fill_color, original_color); 
FloodFill(x, v — I. fill_color, original_color); 

} 

} 


3.30 The coordinates of the vertices of a polygon are shown in Fig. 3-39. (a) Write the initial edge list for 
the polygon, (b) State which edges will be active on scan lines y = 6, 7, 8, 9, and 10. 


y 

10 - 

9 - 
8 - 

7 

6 4 


• 


r 




I • 

• £4 

Vi# #v> 


J 1 I I I I I 1 1 1 

1234 5 678 9 10 x 


Fig. 3-39 


SOLUTION 

(a) Column x contains the x coordinate of the corresponding edge's lower endpoint. Horizontal edges are not 
included. 


Edge 

■Vmin 

y max 

X 

\/m 


4 

7 

9 

0 

eI 

4 

7 

2 

0 

e 4 

7 

9 

8 

0 

e 6 

7 

9 

4 

0 
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( b ) An edge becomes active when the scan line value y equals the edge’s y mm value. The edge remains active 
until the scan line value y goes beyond the edge’s y raax value. Therefore, the active edges for y = 6, 7, 8, 
9, and 10 appears as follows. 

At v = 6, E 2 and £g. 

At y = 7, y = y niax for both edges E 2 and so they remain active. Also at v = 7, edges E A and E 6 
become active. 

At y = 8, E 2 and E % are removed from the edge list. E A and E h remain active. 

At y = 9, the active edges remain the same. At y = 10, edges E 2 and E A are removed from the edge list 
and the edge list becomes empty. 


3.31 What are the three major adverse side effects of scan conversion? 

SOLUTION 

The three major adverse effects of scan conversion are staircase appearance, unequal brightness of slanted 
lines, and the picket fence problem. 


3.32 Suppose that in 3 x 3 super sampling a pixel has three of its subpixels in a red area, three in a green 
area, and three in a blue area, what is the pixel’s overall color? 

SOLUTION 

Each of the three areas is responsible for one third of the pixel’s overall intensity value, which is (y, j). 


3.33 Write a pseudo-code procedure for generating the Koch curve K„ (after the one in the text for 
generating C„). 

SOLUTION 

Koch-curve (float x. v, len, alpha; int «) 

{ 

if (w > 0) { 
len = len/3; 

Koch-curve(x, y, len, alpha, n — 1); 
x = x + Ien*cos(alpha); 
y = y -1- len*sin(alpha); 

Koch-curve(x, y, len, alpha — 60, n — 1 ); 
x = x 4- len*cos( alpha — 60); 
y = y + len*sin( alpha — 60); 

Koch-curve(x, y, len, alpha + 60, n — I ); 

X = x + len*cos(alpha -f 60); 
y = y + len*sin(alpha + 60); 

Koch-curve(x, y, len, alpha, n — 1 ); 

} else 

line(x. y, x -I- len*cos(alpha), v + len*sin( alpha)); 

> 


3.34 Presume that the following statement produces a filled triangle with vertices at (X[ , y, ), (x 2 , y 2 ), and 
triangle(x, . y, , x 2 , y 2 , x 3 , y 3 ) 
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Write a pseudo-code procedure for generating the Sierpinski gasket S„ (after the procedure in the text for 
generating C„). 

SOLUTION 

S-Gasket (float x lt v, , x 2 , v 2 , x 3 , V 3 ; int n) 

{ 

float x I2 , v 12 , x I3 , y l3 , x 23 , y 23 ; 
if (w > 0 ) { 

■*12 = (-*1 + x 2 )/ 2 ; 

, v i2 = C>i + v 2 )/2; 

-*u =(*1 + x 3 )/ 2 : 

V13 = (v, +y 3 )/2; 

*23 = (*2 + x 3 )/ 2 ; 
y 2 3 = (v 2 +y 3 )/ 2 ; 

S-Gasket(x,, Vi,x 12 , v 12 ..t|3, v,3, « - 1); 

S-Gasket(x 12 , y 12 , x 2 , y 2 , x 23 , y 23 , n — 1); 

S-Gasket(x 13 , _Vi 3 , x 23 , y 23 , x 3 , y 3 , n — 1); 

} else 

triangle(x ls y 3 , x 2 , v 2 , x 3 , Vj); 

} 


Supplementary Problems 

3.35 Given the following equations, find the corresponding values of y for each value of x (x = 2,7, 1): (a) 
v = 4x -f- 3, ( 6 ) y = lx -I- 0, (c) y = — 3x — 4, and (d) y = — 2x + 1 . 

3.36 What steps are required to plot a line whose slope is between 45° and 90° using Bresenham's method? 

3.37 What steps are required to plot a dashed line? 

3.38 Show graphically that an ellipse has tour-way symmetry by plotting four points on the ellipse: 

x = a cos( 0 ) + h y = b sin( 0 ) + k 

where a = 2 
b= 1 
h = 0 
k = 0 

0 = n/4, 37t/4, 57t/4, 7n/4 

3.39 How must Prob. 3.21 be modified if an ellipse is to be rotated (a) n/4, ( b ) n/9, and (c) n/2 radians? 


3.40 What steps are required to scan-convert a sector using the trigonometric method? 
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3.41 What steps must be added to a fill algorithm if a region is to be filled with a pattern? 

3.42 Why is it important for the designer to remain consistent when choosing either local or global aliasing? 

3.43 What steps are required to scan-convert a polygonal area using the scan-line algorithm? 

3.44 How can we eliminate overstrike? 




Two-Dimensional 

Transformations 


Fundamental to all computer graphics system is the ability to simulate the manipulation of objects in space. 
This simulated spatial manipulation is referred to as transformation. The need for transformation arises 
when several objects, each of which is independently defined in its own coordinate system, need to be 
properly positioned into a common scene in a master coordinate system. Transformation is also useful in 
other areas of the image synthesis process (e.g. viewing transformation in Chap. 5). 

There are two complementary points of view for describing object transformation. The first is that the 
object itself is transformed relative to a stationary coordinate system or background. The mathematical 
statement of this viewpoint is described by geometric transformations applied to each point of the object. 
The second point of view holds that the object is held stationary while the coordinate system is transformed 
relative to the object. This effect is attained through the application of coordinate transformations. An 
example that helps to distinguish these two viewpoints involves the movement of an automobile against a 
scenic background. We can simulate this by moving the automobile while keeping the backdrop fixed (a 
geometric transformation). Or we can keep the car fixed while moving the backdrop scenery (a coordinate 
transformation). 

This chapter covers transformations in the plane, i.e., the two-dimensional (2D) space. We detail three 
basic transformations: translation, rotation, and scaling, along with other transformations that can be 
accomplished in tenns of a sequence of basic transformations. We describe these operations in 
mathematical form suitable for computer processing and show how they are used to achieve the ends of 

object manipulation. 


4.1 GEOMETRIC TRANSFORMATIONS 

Let us impose a coordinate system on a plane. An object Obj in the plane can be considered as a set of 
points. Every object point P lias coordinates (x, y), and so the object is the sum total of all its coordinate 
points (Fig. 4-1). If the object is moved to a new position, it can be regarded as a new object Obj'. all of 
whose coordinate points P' can be obtained from the original points P by the application of a geometric 

transformation. 
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Fig. 4-1 


Translation 

In translation , an object is displaced a given distance and direction from its original position. If the 
displacement is given by the vector v = t x l + tj, the new object point F"(x\/) can be found by applying 
the transformation T y to P(x, y) (see Fig. 4-2), 

P' = T y (P) 

where xf = .v + t x and / = y + t v . 



Fig. 4-2 


Rotation about the Origin 

In rotation , the object is rotated 9" about the origin. The convention is that the direction of rotation is 
counterclockwise if fl is a positive angle and clockwise if 9 is a negative angle (see Fig. 4-3). The 
transformation of rotation R„ is 

P' = R 6 (P) 

where x 1 = r cos (9) - ysin(0) and / = xsin(9) -fycos (9). 
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Scaling with Respect to the Origin 

Scaling is the process of expanding or compressing the dimensions of an object. Positive scaling 
constants s x and s y are used to describe changes in length with respect to the x direction and y direction, 
respectively. A scaling constant greater than one indicates an expansion of length, and less than one, 
compression of length. The scaling transformation S s . s is given by P = S,. v (P) where x = s x x: and 
/ = s v y. Notice that, after a scaling transformation is perforated, the new object is located at a different 
position relative to the origin. In fact, in a scaling transfonnation the only point that remains fixed is the 
origin. Figure 4-4 show's scaling transfonnation with scaling factors s t — 2 and s,, — 




If both scaling constants have the same value s, the scaling transfonnation is said to be homogeneous 
or uniform. Furthermore, if.v > 1, it is a magnification and for s < 1, a reduction. 


Mirror Reflection about an Axis 

If either the x or y axis is treated as a mirror, the object has a mirror image or reflection. Since the 
reflection P of an object point P is located the same distance from the mirror as P (Fig. 4-5), the mirror 
reflection transfonnation M x about the x axis is given by 

P'=M X (P) 


where x' — x and V — — v. 
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y) 


Fig. 4-5 

Similarly, the mirror reflection about the y axis is 

P' = M V (P) 

where x' = — x and / = y. 

Note that A/,. = _] and M xy = S u . The two reflection transformations are simply special cases of 

scaling. 


Inverse Geometric Transformations 

Each geometric transformation has an inverse (see App. 1) which is described by the opposite 
operation perfonned by the transformation: 

Translation: 77 1 = T_ v , or translation in the opposite direction 
Rotation: R a 1 = R_ e , or rotation in the opposite direction 
Scaling: S s ~J Sy = S 1/SxA/Sy 
Mirror reflection: Mf 1 = M x and M v 1 = M v . 

4.2 COORDINATE TRANSFORMATIONS 

Suppose that we have two coordinate systems in the plane. The first system is located at origin O and 
has coordinates axes xy. The second coordinate system is located at origin O' and has coordinate axes xV 
(Fig. 4-6). Now each point in the plane has two coordinate descriptions: (x, v) or (x',/), depending on 
which coordinate system is used. If we think of the second system .vV as arising from a transformation 
applied to the first system xy, we say that a coordinate transformation has been applied. We can describe 

'*4 


JT ' 

• P(x, y) 
(x'.jO 




Fig. 4-6 
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this transformation by determining how the (x ■',/) coordinates of a point P are related to the (x, y) 
coordinates of the same point. 


Translation 

If the xv coordinate system is displaced to a new position, where the direction and distance of the 
displacement is given by the vector v = f t l + J, the coordinates of a point in both systems are related by 
the translation transformation T y .\ 

(x\/) = f v (x,y) 

where x' — x — t x and \f — y — t v . 


Rotation about the Origin 

The xy system is rotated 0 about the origin (see Fig. 4-7). Then the coordinates of a point in both 
systems are related by the rotation transformation R e : 

(x',.v) = ^ e (x, v) 

where x' = xcos(0) + _ysin(fl) and yf = — xsin(0) + .vcos($). 



Scaling with Respect to the Origin 

Suppose that a new coordinate system is formed by leaving the origin and coordinate axes unchanged, 
but introducing different units of measurement along the x and y axes. If the new units are obtained from 
the old units by a scaling of s x along the x axis and s y along the v axis, the coordinates in the new system 
are related to coordinates in the old system through the scaling transformation S s s : 

(x',y / )-S s ^(x,y) 

where x' = (1 /s x )x and \- = (l/s v )y. Figure 4-8 shows coordinate scaling transformation using scaling 
factors s x — 2 and s v — 


Mirror Reflection about an Axis 

If the new coordinate system is obtained by reflecting the old system about either x or y axis, the 
relationship between coordinates is given by the coordinate transformations M x and M y . For reflection 
about the x axis [Fig. 4-9(n)] 


(x',J /)=M x (x,y) 
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Fig. 4-8 


where x —x and / — — y. For reflection about the y axis [Fig. 4-9(6)] 

(x'.y) = M y (x,y) 



Fig. 4-9 

Notice that the reflected coordinate system is left-handed; thus reflection changes the orientation of the 
coordinate system. Also note that M x = S, , and M v = j - 

Inverse Coordinate Transformations 

Each coordinate transformation has an inverse (see App. 1) which can be found by applying the 
opposite transformation: 

Translation: T ~ 1 = T v , translation in the opposite direction 
Rotation: R 0 1 = R _ g , rotation in the opposite direction 
Scaling: S~] = S Uj ^ l/Sy 
Mirror reflection: M x 1 = M x and M v 1 = M y 

4.3 COMPOSITE TRANSFORMATIONS 

More complex geometric and coordinate transformations can be built from the basic transformations 
described above by using the process of composition of functions (see App. 1). For example, such 
operations as rotation about a point other than the origin or reflection about lines other than the axes can be 
constructed from the basic transformations. 

EXAMPLE 1. Magnification of an object while keeping its center fixed (see Fig. 4-10). Let the geometric center be 
located at C(h,k) [Fig. 4-1 0(a)]. Choosing a magnification factor s > 1, we construct the transformation by 
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Fig. 4-10 

performing the following sequence of basic transformations: (1) translate the object so that its center coincides with 
the origin [Fig. 4-10(6)], (2) scale the object with respect to the origin [Fig. 4- 10(c)], and (3) translate the scaled object 
back to the original position [Fig. 4-10(<f)]. 

The required transformation S aC can be formed by compositions S sC = T v ■ S ss ■ T~ x where v = AI + AJ. By 
using composition, we can build more general scaling, rotation, and reflection transformations. For these transforma- 
tions, we shall use the following notations: (1) S^ yP - -scaling with respect to a fixed point P; (2) R g P rotation 
about a point P: and (3) M L — reflection about a line L. 

The matrix description of these transformations can be found in Probs. 4.4, 4.7, and 4.10. 

Matrix Description of the Basic Transformations 

The transformations of rotation, scaling, and reflection can be represented as matrix functions: 
Geometric transformations Coordinate transformations 

R _ /cos(0) — sin(0) \ r —( cos ( 0 ) sin(0)\ 

8 V sin(0) cos(0) / 8 V — sin(0) cos(0) / 
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The translation transformation cannot be expressed as a 2 x 2 matrix function. However, a certain artifice 
allows us to introduce a 3 x 3 matrix function which performs the translation transformation. 

We represent the coordinate pair (x, v) of a point P by the triple (x,y, 1). This is simply the 
homogeneous representation ofP (App. 2). Then translation in the direction v = t v I r v ,J can be expressed 
by tire matrix function 

(\ 0 t x 

T v = 0 1 t y 

Vo o i 


(.x + f x 

VV’ 

From this we extract the coordinate pair (x + y + t v ). 


Tlien 



Concatenation of Matrices 

The advantage of introducing a matrix form for translation is that we can now build complex 
transformations by multiplying the basic matrix transformations. This process is sometimes called 
concatenation of matrices and the resulting matrix is often referred to as the composite transformation 
matrix (CTM). Here, we are using the fact that the composition of matrix functions is equivalent to matrix 
multiplication (App. 1). We must be able to represent the basic transformations as 3 x 3 homogeneous 
coordinate matrices (App. 2) so as to be compatible (from the point of view of matrix multiplication) with 
the matrix of translation. This is accomplished by augmenting the 2x2 matrices with a third column 

M 

I 0 I and a third row (0 01). That is 

( a b 0 
c d 0 
0 0 1 


EXAMPLE 2. Express as a matrix (i.e., CTM) the transformation which magnifies an object about its center 
C(h, k). From Example 1 , the required transformation S s C can be written as 

Ss,c = T v -S ! ' S -T~' 


/I 

0 h\ 

( S 

0 

0\ 

/I 0 -h\ 

I s 

0 

—sh + if 

0 

1 k 

0 

s 

0 

0 1 -k 

= 0 

s 

— sk 4“ k 

\0 

0 1 / 

\0 

0 

1/ 

\0 0 1 J 

\o 

0 

1 ) 


Caution on Matrix Notations 

The reader should be alerted to the fact that, within the field of computer graphics, there are two 
different matrix notations that are used. This book represents points by column vectors and applies 
transformations by left-multiplying by the transformation matrix. We have chosen this approach because it 
is the standard used in mathematics and computer science texts. The other notation represents points by 
row vectors and applies transformations by right-multiplying by the transformation matrix. It is used in 
much of the computer graphics literature. 
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To change from one notational style to another, it is necessary to take the transpose of the matrices that 
appear in any expression. For example, translation of point (x, y) in the direction v = f T I 4- f v J can also be 
expressed as 

1 0 0\ 

0 1 0 ) =(x + t x y + t y 1) 

fr K. \) 

EXAMPLE 2 continued. Using the row-vector notation, we have 

( 1 0 0\/s 0 OWl 0 0\ / 5 0 0\ 

0 10 0 s 0 0 10 =1 0 s 0 

—h -k 1 / \0 0 1 / V* k 1/ \-sh+h -sk + k 1/ 


4.4 INSTANCE TRANSFORMATIONS 

Quite often a picture or design is composed of many objects used several times each. In turn, these 
objects may also be composed of other symbols and objects. We suppose that each object is defined, 
independently of the picture, in its own coordinate system. We wish to place these objects together to form 
the picture or at least part of the picture, called a subpicture. We can accomplish this by defining a 
transformation of coordinates, called an instance transformation , which converts object coordinates to 
picture coordinates so as to place or create an instance of the object in the picture coordinate system. 

The instance transformation A' pjcture ob j ect is formed as a composition or concatenation of scaling, 
rotation, and translation operations, usually performed in tins order (although any order can be used): 

^picture.object * ^d.P ' ^a.b.P 

With the use of different instance transformations, the same object can be placed in different 
positions, sizes, and orientations within a subpicture. For instance. Fig. 4-1 1 (a) is placed in the picture 
coordinate system of Fig. 4-1 1(b) by using the instance transformations 7V picture _ oblect . 


(x y 1) 



( a ) Object coordinate system. 



(b) Picture coordinate system. 


Fig. 4-11 


Nested Instances and Multilevel Structures 

A subpicture or picture may exhibit a multilevel or nested structure by being composed of objects 
which are, in turn, composed of still other objects, and so on. Separate instance transformations must then 
be applied in principle, at each level of the picture structure for each picture component. 

EXAMPLE 3. A picture of an apple tree contains branches, and an apple hangs on each branch. Suppose that each 
branch and apple is described in its own coordinate system [Figs. 4-12(a) and 4-12(6)]. Then a subpicture call to place 
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-^branch 

k 


■^branch 


(b) Branch. 


^pklure 



(c) Picture coordinate system. 
Fig. 4-12 


an instance of this branch in the picture of the tree requires an additional subpicture call to place an instance of the 
apple into the branch coordinate system. 

We can perform each instance transformation separately, i.e., instance the apple in the branch coordinate system 
and then instance both branch and apple from the branch coordinate system to the picture coordinate system. 
However, it is much more efficient to transform the apple directly into picture coordinates [see Fig. 4-1 2(c)]. This is 
accomplished by defining the composite transformation matrix U pjctureol , |ecl to be the composition of the nested 
instance transformations from apple coordinates to branch coordinates and then from branch coordinates to picture 
coordinates: 


^■picture, apple ^picture, branch ' ^branch, apple 


Since the branch subpicture is only one level below the picture 


^"picture, branch ^picture, branch 


Solved Problems 

4.1 Derive the transformation that rotates an object point & about the origin. Write the matrix 
representation for this rotation. 

SOLUTION 

Refer to Fig. 4-13. Definition of the trigonometric functions sin and cos yields 

x 1 = r cos(0 -F (j>) y' = r sin(0 + <f>) 

and 

x = r cos <f> y = r sin <f> 

Using trigonometric identities, we obtain 

r cos (6 + 4>) = r( cos 0 cos (f> — sin 0 sin r/>) = .r cos 9 — y sin 9 
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and 


r sin(0 + <j>) = risin 0 cos (j> + cos 9 sin <f>) = xsin 9 — v cos 0 


or 


x' = x cos 6 — y sin 6 


y = x sin# + y cos 0 


Writing P = 



/ cos0 — sin$\ 
y sin 6 cos 6 ) 


we can now write P = R g ■ P. 



4.2 (a) Find the matrix that represents rotation of an object by 30° about the origin. 

(b) What are the new coordinates of the point P( 2, —4) after the rotation? 

SOLUTION 

(a) From Prob. 4.1: 


/ cos 30 — sin 30 \ 

(V3 

2 

1 \ 

2 

\ sin 30 ° cos 30 c ' / 

1 



\ 2 

t) 


(b) So the new coordinates can be found by multiplying: 


If 


V 2 




4.3 Describe the transformation that rotates an object point, Q(x, v), 0 about a fixed center of rotation 
P(h, k) (Fig. 4-14). 

SOLUTION 

We determine the transformation R, l r in three steps: ( 1 ) translate so that the center of rotation P is at the 
origin, (2) perfonn a rotation of 9 degrees about the origin, and (3) translate P back to (/;. k). 

Using v = hi 4- kJ as the translation vector, we build Rg P by composition of transformations: 


Rft.tr — T\ ' Rft ' T-y 
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4.4 


4.5 


Write the general form of the matrix for rotation about a point P(h, k). 

SOLUTION 

Following Prob. 4.3, we write R ep = T v ■ R e ■ T_ y , where v = hi + kJ. Using the 3 x 3 homogeneous 
coordinate fonn for the rotation and translation matrices, w'e have 


o,p 



\ 

/I 

0 

-n 


0 

1 

-* 

/ 

v0 

0 

• ) 


sm(G) cos(O) 0 
\ 0 0 

( cos(fl) — sin(O) [— It cos(O) + k sin(fl) + h \ ' 

= sin (0) cos (9) [— hsm(f)) — kcos(0) + k] 

^00 1 

Perform a 45° rotation of triangle /1((), 0), 5(1, 1), C(5,2) {a) about the origin and ( b ) about 

P(-l,-l). 

SOLUTION 

We represent the triangle by a matrix fonned from the homogeneous coordinates of the vertices: 



(a) The matrix of rotation is 





(± 2 

s/2 

/ cos 45° 

— sin 45 ,: ' 

o\ 

2 

2 

R 4 i = [ sin 45° 

cos 45° 

° = 

V2 

V2 

\ 0 

0 

1/ 

2 

\ o 

2 

0 


-V- --TT- 0 

0 

1 / 


So the coordinates A'B'C of the rotated triangle ABC can be found as 


[A'B'C'\ = S 45 • [/ISC] = 


Q \ 

2 2 

4 £ 0 

2 2 

\ 0 0 I / 

Thus A’ = (0. 0), S' = (0, V2). and C' = (§ sfl,\ n/2). 


A' S' C 



0 0 


0 v/2 

Vl 1 


3n/2 \ 

2 

7 n/2 
2 

I / 
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(b) From Prob. 4.4, the rotation matrix is given by R^p = T y ■ R 45 . • T_ v , where v = — I — J. So 


MS ',/* ’ 


\ 0 -I 

0 1 -I 

.0 0 I 


/V2 _s/2 \ 

2 2 
V2 V2 „ 


'] 0 1 

0 1 1 

,0 0 1 


/y/2 _^2 

2 2 

V2 \fl 


(>/ 2 - 1 ) 


Now 


\ 0 0 

.7 


\ 0 0 



(s/ 2 

V2 

-1 ^ 




2 

2 


(° 1 

5\ 

,/> • [ABC] = 

V2 

2 

Vl 

2 

(V2-1) 


2 

1/ 


V o 

0 

1 ) 




/ -1 -1 (fv^-lA 

(s/2-1) (2v/2— I) (|V2 - 1) 

\ 1 1 1 / 


/ 


So = (-1, - 1), £' = (-!, 2>/2- 1), and C 1 =(W 2- 1,|V2- 1). 


4.6 Find the transfonnation that scales (with respect to the origin) by (a) a units in the X direction, ( b ) b 
units in the Y direction, and (c) simultaneously a units in the X direction and b units in the Y 
direction. 

SOLUTION 

(a) The scaling transfonnation applied to a point P(x,y ) produces the point (ax, y). We can write this in 
matrix form as S a , • P, or 


(o :)(;)=(:) 

( b ) As in part (a), the required transformation can be written in matrix form as S i ;> • P. So 

0 1)C)-U) 

(c) Scaling in both directions is described by the transfonnation x = ax and / = by. Writing this in matrix 
fonn as S a b ■ P, we have 


(o °)(;M;) 


4.7 Write the general form of a scaling matrix with respect to a fixed point P(h. k). 

SOLUTION 


Following the same general procedure as in Probs. 4.3 and 4.4, we write the required transformation with 
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v = /it + AJ as 


/> 

0 

h\ 

fa 0 0\ 

/I 

0 

0 

1 

k 

0 1 

b 0 

0 

1 

Vo 

0 

1/ 

\o 0 \) 

\0 

0 

( a 

0 

—ah + h \ 




0 

b 

-bk+k 




\0 

0 


1 ) 





-h 

-k 

1 


4.8 Magnify the triangle with vertices A(0, 0), B{ 1 , 1), and C(5, 2) to twice its size while keeping 
C(5, 2) fixed. 

SOLUTION 


From Prob. 4.7, we can write the required transformation with v = 51 + 2J as 


Si 


.2 ,C ' 


T, ■ S 2t 2 

T_, 

V 








/I 0 

5 ) 

( 2 

0 

0\ 

( 1 0 


I 2 

0 

—5 \ 


2 

p 

2 

0 


-2 

= 0 

2 

—2 

Vo 0 

1 J 

Vo 

0 

'J 

Vo 0 

w 

Vo 

0 

1/ 


Representing a point P with coordinates (x,.v) by the column vector 


I, we have 



So A ’ = (—5, —2), B' = (—3, 0), and C' = (5, 2). Note that, since the triangle ABC is completely detennined 
by its vertices, we could have saved much writing by representing the vertices using a 3 x 3 matrix 


[ABC\ = 


5 

2 

1 


and applying S 12 c to this. So 


S 2 X c-[ABC\ = 




-3 5\ 

0 2= [ A'B'C] 


4.9 Describe the transformation M L which reflects an object about a line L. 

SOLUTION 

Let line L in Fig. 4-15 have a y intercept (0, b) and an angle of inclination 0 (with respect to the x axis). 
We reduce the description to known transformations: 
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1 . Translate the intersection point B to the origin. 

2. Rotate by — 0° so that line L aligns with the .v axis. 

3. Mirror-reflect about the .v axis. 

4. Rotate back by 0°. 

5. Translate B back to (0, b). 

In transformation notation, we have 

M l = T v - Rfj ■ M x R_ e ■ T_ y 


where v = bJ. 


P' 



4.10 Find the fonn of the matrix for reflection about a line L with slope m and y intercept (0, b). 

SOLUTION 

Following Prob. 4.9 and applying the fact that the angle of inclination of a line is related to its slope m by 
the equation tan(0) = m, we have with v - hi, 


M l — T y ■ R () ■ M x ■ R_ e ■ T_y 


0 0\ 

< cos(0) 

— sin(0) 

°\ 

/I 

0 0\ 

1 

cos(0) 

sin(0) 


/. 0 

°\ 

1 4 

sin(0) 

cos (0) 

0 

0 

-1 0 


— sin(0) 

cos(0) 

° 

° ' 

-b 

0 1/ 

l o 

0 

1> 

\0 

0 l) 

V 

0 

0 

l 

\0 0 

'/ 


Now if tan(0) = m, standard trigonometry yields sin(0) = m/s/m 2 + 1 and cos(0) = l/y/m 2 + 1. Substitut- 
ing these values for sin(0) and cos(0) after matrix multiplication, we have 


/ I - m 2 2m 


M, = 


m 2 - 1 


2m 

m 2 + 1 m 2 + 1 
0 0 


—2hm ^ 
m 2 4- 1 
2b 

m 2 + 1 

1 


4.11 Reflect the diamond-shaped polygon whose vertices are A( — 1, 0), B( 0, —2), C(l, 0), and £>((), 2) 
about (a) die hoiizontal line y — 2, (b) the vertical line x — 2, and (c) the line y — x + 2. 
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SOLUTION 

We represent the vertices of the polygon by the homogeneous coordinate matrix 

( -1 0 I 0\ 

0 -2 0 2 

I III / 

From Prob. 4.9, the reflection matrix can be written as 

M l = T v - Rf, M x • R_0 ■ T_ y 

(a) The line y = 2 has y intercept (0,2) and makes an angle of 0 C with the x axis. So with 9 = 0 and v = 2 J, 
the transfonnation matrix is 

( 1 0 0\/l 0 0\/l 0 OWl 0 0\/I 0 0 \ /I 0 0\ 

0 1 2 0 1 Oj O -1 0 0 1 0 0 I —2 1 = I 0 -1 4 

0 0 l/\0 0 l/\0 0 1 / \0 0 ]/\0 0 1 / \0 0 1/ 

This same matrix could have been obtained directly by using the results of Prob. 4. 10 with slope m = 0 
and y intercept b = 2. To reflect the polygon, we set 

A' S' C D' 

(\ 0 0\/-l 0 1 0\ /-I 0 1 0\ 

M l ■ V = I 0 -I 4 0 -2 0 2 1 = 1 4 6 4 2 

\0 0 1 / \ 1 111/ \ 1 1 1 1 / 

Converting from homogeneous coordinates. A' = (— 1 , 4), B' — (0, 6), C' — ( 1 , 4), and D' = (0, 2). 

(b) The vertical line x = 2 has no y intercept and an infinite slope! We can use M v , reflection about the y 
axis, to write the desired reflection by ( 1 ) translating the given line two units over to the y axis, (2) reflect 
about the y axis, and (3) translate back two units. So with v = 21, 

M L = Ty ■ M y ■ T_y 

1 0 2\ / — 1 0 0 Wl 0 —2 \ /-I 0 4\ 

010 010 01 0 = 010 

0 0 1 / \ 0 0 1 /\0 0 1/ \ 0 0 I y 

Finally 

/-I 0 4\ / -I 0 1 0 \ / 5 4 3 4\ 

M l ■ V = 0 1 0 ( 0 -2 0 2 = I 0 -2 0 2 

V o 0 I / \ 1 111/ V 1 1]l / 

or A' = (5, 0), S' = (4, -2), C' = (3, 0), and D' = (4, 2). 

(c) The line y = A' -F 2 has slope 1 and ay intercept (0, 2). From Prob. 4.10, with m = 1 and b = 2, we find 

/0 1 -2 
M l = I 0 2 

\0 0 1 

The required coordinates A', S', C , and D' can now be found. 

( 0 1 -2W-1 0 1 0\ / — 2 —4 —2 0\ 

10 2 0 -2 02 =1 1 2 32 

00 !/\l l11 / V 1 1 1 1 / 

So A' = (-2, I), S' = (-4, 2), C = (-2. 3), and D' = (0, 2). 



4.12 


The matrix 



defines a transfonnation called a simultaneous sheaving or shearing for short. 


The special case when b = 0 is called shearing in the x direction. When a - 0, we have shearing in 
the y direction. Illustrate the effect of these shearing transformations on the square /!((), 0), B( 1 , 0), 
C(l, 1), and D(0, 1) when a — 2 and b — 3. 
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SOLUTION 

Figure 4-16(a) shows the original square. Fig. 4-1 6(A) shows shearing in the a- direction. Fig. 4-16(<?) 
shows shearing in the y direction, and Fig. 4-1 6(d) shows shearing in both directions. 


y 


D 


C(l, 1) 


4 — i — i — 1—1 — I 

A B 


(a) 


x 





Fig. 4-16 


4.13 An observer standing at the origin sees a point P(l, I). If the point is translated one unit in the 
direction v = I, its new coordinate position is P'(2, 1). Suppose instead that the observer stepped 
back one unit along the a axis. What would be the apparent coordinates of P with respect to the 
observer? 

SOLUTION 

The problem can be set up as a transformation of coordinate systems. If we translate the origin O in the 
direction v = -I (to a new position at O') the coordinates of P in this system can be found by the translation 
7V 


/’ o 

T v -P= 0 1 
\0 0 



So the new coordinates are (2, 1 ) . This has the following interpretation: a displacement of one unit in a given 
direction can be achieved by either moving the object forward or stepping back from it. 


4.14 An object is defined with respect to a coordinate system whose units are measured in feet. If an 
observer’s coordinate system uses inches as the basic unit, what is the coordinate transformation 
used to describe object coordinates in the observer’s coordinate system? 
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SOLUTION 

Since there are 1 2 inches to a foot, the required transfonnation can be described by a coordinate scaling 
transformation with s = -h or 


and so 


XM'o «) 

M;M» .")GME) 


4.15 Find the equation of the circle (a 7 ) 2 + ( i / )‘ = 1 in terms of xy coordinates, assuming that the xV 
coordinate system results from a scaling of a units in the x direction and b units in the y direction. 

SOLUTION 

From the equations for a coordinate scaling transformation, we find 

,1 ,1 

x =-x y = 7 .v 
a o 


Substituting, we have 



Notice that as a result of scaling, the equation of the circle is transformed to the equation of an ellipse in the xy 
coordinate system. 


4.16 Find the equation of the line / = nix' + b in xy coordinates if the x'y' coordinate system results 
from a 90° rotation of the xy coordinate system. 

SOLUTION 

The rotation coordinate transfonnation equations can be written as 

x = xcos(90) +ysin(90 c ') = y y = — xsin(90) + ycos(90°) = —x 
Substituting, we find — x = my -F b. Solving for y, we have y = (— 1 /m)x — b/m. 


4.17 Find the instance transformation which places a half-size copy of the square /1(0, 0), B{ 1 , 0), 
C(1, 1), D(0, 1) [Fig. 4- 1 7(a)] into a master picture coordinate system so that the center of the 
square is at (-1,-1) [Fig. 4-17(6)]. 

SOLUTION 

The center of the square ABCD is at P {\ , ]). We shall first apply a scaling transfonnation while keeping P 
fixed (see Prob. 4.7). Then w'e shall apply a translation that moves the center P to P\— 1,— 1). Taking 
t x = (—1) — (!) = —] and similarly t = — | (so v = — |l — | J), we obtain 


N ■ 

J picture, square 


= T V S 


1 / 2 . 1 / 2 ,P ~ 


/I 0 -I 


0 1 

Vo o 






86 


TWO-DIMENSIONAL TRANSFORMATIONS 


[CHAP. 4 


■^square > 


■^picture - 


D 


Q 


l 

•p 

- - 1 1 1 


A 

1 ,B *«q,me -2 "« 

Q - 

P' 

■Pplcli 



-2 

— 


(a) (6) 


Fig. 4-17 

4.18 Write the composite transformation that creates the design in Fig. 4-19 from the symbols in Fig. 
4-18. 


SOLUTION 


•ftriantW 




■^triangle 


( 0 ) 



Fig. 4-18 


First we create an instance of the triangle [Fig. 4-18(q)] in the square [Fig. 4-18(6)]. Since the base of the 
triangle must be halved while keeping the height fixed at one unit, the appropriate instance transformation is 

^square, triangle = ^1/2,1 ' ‘^1/2,1 • 

The instance transformation needed to place the square at the desired position in the picture coordinate 
system (Fig. 4-19) is a translation in the direction v = 1 + J: 

N — T 

iy picture, square J v 

Then the composite transformation for placing the triangle into the picture is 

^picture, triangle ^picture, square ' ^square, triangle 



Fig. 4-19 
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and the composite transformation to place the square into the picture is 

r, = N ■ 

^picture, square picture, square 


Supplementary Problems 

4.19 What is the relationship between the rotations R s , R_ d , and Rj f 1 ? 

4.20 Describe the transformations used in magnification and reduction with respect to the origin. Find the new 
coordinates of the triangle A( 0, 0), 3(1. 1), C(5, 2) after it has been (a) magnified to twice its size and (b) 
reduced to half its size. 


4.21 Show that reflection about the Line v = x is attained by reversing coordinates. That is, 

M L (x,y) = <y,x) 


4.22 Show that the order in which transformations are performed is important by the transformation of triangle 
A( 1 , 0), 5(0, 1 ), C( 1 . 1 ), by (a) rotating 45° about the origin and then translating in the direction of vector I, 
and (b) translating and then rotating. 


4.23 An object point P(x, vj is translated in the direction v = al + bJ and simultaneously an observer moves in the 
direction v. Show that there is no apparent motion (from the point of view of the observer) of the object point. 

4.24 Assuming that we have a mathematical equation defining a curve in x'y' coordinates, and the TV 
coordinate system is the result of a coordinate transformation from the xv coordinate system, write 
the equation in terms of xv coordinates. 


4.25 Show that 


7V • T v = T v • TV = 7V 


v.+v. 


4.26 Show that S a b ■ S cd = S c<l ■ S a h = S acJ>tl . 


4.27 Show that ■ R js = R ti ■ R x = R„ + p. 

4.28 Find the condition under which we have 




• Ra — Ra • S' 


4.29 Is a simultaneous shearing the same as a shearing in one direction followed by a shearing in another direction? 
Why? 


4.30 Find the condition under which we can switch the order of a rotation and a simultaneous shearing and still get 
the same result. 
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4.31 Express a simultaneous shearing in tenns of rotation and scaling transformations. 

4.32 Express R f) in terms of shearing and scaling transfonnations. 

4.33 Express R e in tenns of shearing transfonnations. 

4.34 Prove that the 2D composite transfonnation matrix is always in the following fonn: 

a b c 
d e f 
0 0 1 

4.35 Consider a line from P, to P 2 and an arbitrary point P on the line. Prove that for any given composite 
transformation the transformed P is on the line between the transformations of P t and P 2 . 



Two-Dimensional 
Viewing and Clipping 


Much like what we see in real life through a small window on the wall or the viewfinder of a camera, a 
computer-generated image often depicts a partial view of a large scene. Objects are placed into the scene 
by modeling transformations to a master coordinate system, commonly referred to as the world coordinate 
system (WCS). A rectangular window with its edges parallel to the axes of the WCS is used to select the 
portion of the scene for which an image is to be generated (see Fig. 5-1). Sometimes an additional 
coordinate system called die viewing coordinate system is introduced to simulate the effect of moving 
and/or tilting the camera. 

On the other hand, an image representing a view often becomes part of a larger image, like a photo on 
an album page, which models a computer monitor’s display area. Since album pages vary and monitor 
sizes differ from one system to another, we want to introduce a device-independent tool to describe the 
display area. This tool is called the normalized device coordinate system (NDCS) in which a unit (1x1) 
square whose lower left corner is at the origin of the coordinate system defines the display area of a virtual 
display device. A rectangular viewport with its edges parallel to the axes of the NDCS is used to specify a 
sub-region of the display area that embodies the image. 


Viewing 

coordinate 

system 

A 



World coordinate system 


Workstation 


a Workstation 1 

viewport 

window 



Viewport / 

! / 


/\f\\ 

o 

i LAI 



^ Normalized device 

coordinate system 

Fig. 5-1 Viewing transformation. 


Device/image 
coordinate system 
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The process that converts object coordinates in W'CS to normalized device coordinates is called 
window-to-viewport mapping or normalization transformation, which is the subject of Sect. 5.1. The 
process that maps normalized device coordinates to discrete device/image coordinates is called work- 
station transformation, which is essentially a second window-to-viewport mapping, with a workstation 
window in the normalized device coordinate system and a workstation viewport in the device coordinate 
system. Collectively, these two coordinate-mapping operations are referred to as viewing transformation. 

Workstation transformation is dependent on the resolution of the display device/ frame buffer. When 
the whole display area of the virtual device is mapped to a physical device that does not have a 1/1 aspect 
ratio, it may be mapped to a square sub-region (see Fig. 5-1) so as to avoid introducing unwanted 
geometric distortion. 

Along with the convenience and flexibility of using a w'indow to specify a localized view comes the 
need for dipping, since objects in the scene may be completely inside the w'indow, completely outside the 
w'indow, or partially visible through the window (e.g. the mountain-like polygon in Fig. 5-1). The clipping 
operation eliminates objects or portions of objects that are not visible through the window to ensure the 
proper construction of the corresponding image. 

Note that clipping may occur in the world coordinate or viewing coordinate space, w'here the window 
is used to clip the objects; it may also occur in the normalized device coordinate space, where the 
viewport/ workstation window is used to clip. In either case we refer to the window or the view- 
port/w'orkstation window as the dipping window. We discuss point clipping, line clipping, and polygon 
clipping in Secs. 5.2, 5.3, and 5.4, respectively. 


5.1 WINDOW-TO-VIEWPORT MAPPING 

A window' is specified by four werld coordinates: vv.v mm , w.v max , w> mjn , and w;y max 


Similarly, a viewport is described by four normalized device coordinates: r.v mjn , t?x max , i;y min 


(see Fig. 5-2). 
, and tV’max* 

The objective of window-to-viewport mapping is to convert the world coordinates (wx, vyy) of an arbitrary 
point to its corresponding normalized device coordinates (vx, vy). In order to maintain the same relative 
placement of the point in the viewport as in the window, we require: 


wx — M’X n 


vx - vx„ 


and 




- ww. 


Thus 


vx ■ 


(wx -wx m J + vx„ 


vy 


(Wy - VVVjnjn) + UV r 


Since the eight coordinate values that define the window and the viewport are just constants, we can 
express these two formulas for computing (vx, vv) from (wx, vyy) in terms of a translate-scale-translate 
transformation N 


where 


N = 



= N- 


1JX — I'X 




0 o\ 

Oinax LVmin 


vyy r 


0 


0 


\) 



-WX,, 

-WV n 

1 
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Fig. 5-2 Window-to-viewport mapping. 

Note that geometric distortions occur (e.g. squares in the window become rectangles in the viewport) 
whenever the two scaling constants differ. 


5.2 POINT CLIPPING 

Point clipping is essentially the evaluation of the following inequalities: 

■'mm — — *max Tmin — — .V'max 

where x min , x max , y nim and y max defttte tire clipping window. A point (.v, y) is considered inside the window 
when the inequalities all evaluate to true. 


5.3 LINE CLIPPING 

Lines that do not intersect the clipping window are either completely inside the window or completely 
outside the window. On the other hand, a line that intersects the clipping window is divided by the 
intersection point(s) into segments that are either inside or outside the window. The following algorithms 
provide efficient w'ays to decide the spatial relationship between an arbitrary line and the clipping window 
and to find intersection point(s). 


The Cohen-Sutherland Algorithm 

In this algorithm we divide the line clipping process into tw'o phases: ( 1 ) identify those lines which 
intersect the clipping w'indow' and so need to be clipped and (2) perform the clipping. 

All lines fall into one of the following clipping categories'. 

1 . Visible both endpoints of the line lie within the window. 

2. Not visible — the line definitely lies outside the window. This will occur if the line from (jr, , Vj ) to 
(xy.Vi) satisfies any one of die following four inequalities: 

.Yl,X 2 >X max yi,y 2 > Vmax 

-' 1 ■ -'l < ~ -'mm Tl ’ .' ? 2 < ~ .'mill 

3. Clipping candidate — the line is in neither category 1 nor 2. 

In Fig. 5-3, line AB is in category I (visible); lines CD and EF are in category 2 (not visible); and lines 
GH. IJ, and KL are in category 3 (clipping candidate). 
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■*min 


Fig. 5-3 

The algorithm employs an efficient procedure for finding the category of a line. It proceeds in two 
steps: 

1 . Assign a 4-bit region code to each endpoint of the hue. The code is determined according to which 
of the following nine regions of the plane the endpoint lies in 


1001 | 1000 | 1010 



oioi ; oioo ; oiio 


■'rain Anax 

Starting from the leftmost bit, each bit of the code is set to true ( 1 ) or false (0) according to the 
scheme 

Bit 1 = endpoint is above the window = sign ( y — y ntax ) 

Bit 2 s= endpoint is below the window = sign ( _v min - >') 

Bit 3 = endpoint is to the right of the window = sign (x - x max ) 

Bit 4 = endpoint is to the left of the window — sign (x min - x) 

We use the convention that sign(n) = 1 if a is positive, 0 otherwise. Of course, a point with code 
0000 is inside the window. 

2. The line is visible if both region codes are 0000, and not visible if the bitwise logical AND of the 
codes is not 0000, and a candidate for clipping if the bitwise logical AND of the region codes is 
0000 (see Prob. 5.8). 

For a line in category 3 we proceed to find the intersection point of the line with one of the boundaries 
of the clipping window, or to be exact, with the infinite extension of one of the boundaries (see Fig. 5-4). 

We choose an endpoint of the line, say (x, , Vj ), that is outside the window, i.e., whose region code is not 
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Fig. 5-4 


0000. We then select an extended boundary line by observing that those boundary lines that are candidates 
for intersection are the ones for which the chosen endpoint must be “pushed across” so as to change a “ 1” 
in its code to a “0” (see Fig. 5-4). This means: 

If bit 1 is 1, intersect with line y =y max . 

If bit 2 is 1, intersect with line y = _v mm . 

If bit 3 is 1, intersect with line x = x max . 

If bit 4 is 1, intersect with line x — x min . 

Consider line CD in Fig. 5-4. If endpoint C is chosen, then the bottom boundary line y = y mjn is 
selected for computing intersection. On the other hand, if endpoint D is chosen, then either the top 
boundary line y = y max or the right boundary line x = x max is used. The coordinates of the intersection 
point are 

Xj — x min or x max if the boundary line is vertical 

= v, + m(x, - x, ) 

or 

( Xj — Xj + (v,- — y\)/m if the boundary line is horizontal 

TV Train ®r 3 max 

where m = (y 2 — V\)/(x 2 — X] ) is the slope of the line. 

Now we replace endpoint (x, , jt ) with the intersection point (x,-,.y f ), effectively eliminating the portion 
of the original line that is on the outside of the selected window boundary. The new endpoint is then 
assigned an updated region code and the clipped line re-categorized and handled in the same way. This 
iterative process terminates when we finally reach a clipped line that belongs to either category 1 (visible) 
or category 2 (not visible). 


Midpoint Subdivision 

An alternative way to process a line in category 3 is based on binary search. The line is divided at its 
midpoint into two shorter line segments. The clipping categories of the two new' line segments are then 
determined by their region codes. Each segment in category 3 is divided again into shorter segments and 
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categorized. This bisection and categorization process continues until each line segment that spans across a 
window boundary (hence encompasses an intersection point) reaches a threshold for line size and all other 
segments are either in category 1 (visible) or in categoiy 2 (invisible). The midpoint coordinates (x m , y m ) of 
a line joining (xy.Vj) and (x 2 , v 2 ) am given by 

*i+*2 Vi +y 2 

The example in Fig. 5-5 illustrates how midpoint subdivision is used to zoom in onto the two 
intersection points /, and I 2 with 10 bisections. The process continues until we reach two line segments that 
are, say, pixel-sized, i.e., mapped to one single pixel each in the image space. If the maximum number of 
pixels in a line is M, this method will yield a pixel-sized line segment in N subdivisions, where 2 A = M or 
N — log, M . For instance, when M = 1024 we need at most N — log, 1024 = 10 subdivisions. 



The Liang-Barsky Algorithm 

The following parametric equations represent a line from (X[ , y { ) to (x 2 ,y 2 ) along with its infinite 
extension: 


x — Xj + Ax ■ u 
y=yi + Ay • u 


where Ax = x 2 — x, and Ay = y 2 — >’i . The line itself corresponds to 0 < u < 1 (see Fig. 5-6). Notice that 
when we traverse along the extended fine with u increasing from — oc to -poo, we first move from the 
outside to the inside of the clipping window’s two boundary lines (bottom and left), and then move from 
the inside to the outside of the other two boundary lines (top and right). If we use and u 2 , where /q < u 2 , 
to represent the beginning and end of the visible portion of the line, we have //, = maximum(0, u h u h ) and 
u 2 = minimum) l , u,, u r ), where u h u b , u,, and u r correspond to the intersection point of the extended line 
with the window’s left, bottom, top, and right boundary, respectively. 
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Now consider the tools we need to turn this basic idea into an efficient algorithm. For point (x, y) 
inside the clipping window, we have 




Y ■ 

A min 

< Xj + Av ■ 

U < 

Xmax 



.'min 

< y, + Ay • 

U < 

.'max 

Rewrite the four inequalities 

as 







Pk u 5 

. q k* k 

= 1, 

2,3,4 

where 







P\ = 

= —Ax 

«l = X| - 

■^min 

(left) 


Pi = 

= Ax 

7 2 -'max 

-^1 

(right) 


Pi = 

- -Ay 

<h =y\ - 

Aniin 

( bottom) 


Pa = 

- Ay 

9a .' ; max 

-yi 

(top) 


Observe the following facts: 

• if p k — 0, the line is parallel to the corresponding boundary and 

if q k < 0, the line is completely outside the boundary and can be eliminated 

if q k > 0, the line is inside the boundary and needs further consideration, 

• if p k < 0, the extended line proceeds from the outside to the inside of the corresponding boundary 
line, 

• if p k > 0, the extended line proceeds from Hie inside to the outside of the corresponding boundary 
line, 

• when p k 7^ 0, the value of u that corresponds to the intersection point is q k / p k . 

The Liang Barsky algorithm for finding the visible portion of the line, if any, can be stated as a four-step 
process: 

1 . [ f p k = () and c/ k < 0 for any k, eliminate the line and stop. Otherwise proceed to the next step. 

2. For all k such that p k < 0, calculate r k — q k jp k . Let u t be the maximum of the set containing 0 
and the calculated r values. 

3. For all k such that p k > 0, calculate r k — q k /p k . Let u 2 be the minimum of the set containing 1 
and the calculated r values. 

4. If i< i > u 2 , eliminate tire line since it is completely outside the clipping window. Otherwise, use », 
and u 2 to calculate the endpoints of the clipped line. 
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5.4 POLYGON CLIPPING 

In this section we consider the case of using a polygonal clipping window' to clip a polygon. 


Convex Polygonal Clipping Windows 

A polygon is called convex if the line joining any tw'O interior points of the polygon lies completely 
inside the polygon (see Fig. 5-7). A non-convex polygon is said to be concave. 




Concave 

polygon 


Fig. 5-7 


By convention, a polygon w'ith vertices P t P N (and edges P, _ , P, and P^P\) is said to be 

positively oriented if a tour of the vertices in the given order produces a counterclockwise circuit. 

Equivalently, the left hand of a person standing along any directed edge P i _ i P, or P,\P\ would be 
pointing inside the polygon [see orientations in Figs. 5-8 (<j) and 5-8(6)]. 



(o) Positive orientation. ( b ) Negative orientation. 


Fig. 5-8 

Let A(xy,yi) and B(x 2 ,y 2 ) be the endpoints of a directed line segment. A point P(x,v) will be to the 
left of the line segment if the expression C = (x 2 — X])(v — Vj ) — (y 2 — Vj )(x — x,) is positive (see Prob. 
5.13). We say that the point is to the right of the line segment if tins quantity is negative. If a point P is to 
the right of any one edge of a positively oriented, convex polygon, it is outside the polygon. If it is to the 
left of every edge of the polygon, it is inside the polygon. 

This observation forms the basis for clipping any polygon, convex or concave, against a convex 
polygonal clipping window. 


The Sutherland-Hodgman Algorithm 

Let P x , .. ., P N be the vertex list of the polygon to be clipped. Let edge E, determined by endpoints A 
and B, be any edge of the positively oriented, convex clipping polygon. We clip each edge of the polygon in 
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turn against the edge E of the clipping polygon, forming a new polygon whose vertices are determined as 
follows. 

Consider the edge P i _ 1 P i : 

1. Ifboth Pj | and P, are to the left of the edge, vertex P, is placed on the vertex output list of the 
clipped polygon [Fig. 5-9(«)]. 

2. Ifboth P, ! and P, are to the right of the edge, nothing is placed on the vertex output list [Fig. 
5-9(6)]. 

3. If P, [ is to the left and P, is to the right of the edge E, the intersection point l of line segment 
P, i P, with the extended edge E is calculated and placed on the vertex output list [Fig. 5-9(c)]. 

4. If P, [ is to the right and P, is to the left of edge E, the intersection point I of the line segment 
Pj \Pj with the extended edge E is calculated. Both / and P, are placed on the vertex output list 
[Fig. 5-9 (</)]. 

The algorithm proceeds in stages by passing each clipped polygon to the next edge of the window and 
clipping. See Probs. 5.14 and 5.15. 


E 


E 


E 

I 


E 

I 





Special attention is necessary in using the Sutherland Hodgman algorithm in order to avoid unwanted 
effects. Consider the example in Fig. 5- 10(a). The correct result should consist of two disconnected parts, a 
square in the lower left comer of the clipping window and a triangle at the top [see Fig. 5-10(6)]. However, 
the algorithm produces a list of vertices (see Prob. 5.16) that fonns a figure with the two parts connected by 
extra edges [see Fig. 5- 10(c)], The fact that these edges are drawn twice in opposite direction can be used 
to devise a post-processing step to eliminate them. 


The VVeiler-Athcrton Algorithm 

Let the clipping window be initially called the clip polygon, and the polygon to be clipped the subject 
polygon [see Fig. 5-1 1(a)]. We start with an arbitrary vertex of the subject polygon and trace around its 
border in the clockwise direction until an intersection with the clip polygon is encountered: 

• If the edge enters the clip polygon, record the intersection point and continue to trace the subject 

polygon. 



98 


TWO-DIMENSIONAL VIEWING AND CLIPPING 


[CHAP. 5 




Fig. 5-10 


Extra 



(c) 


• If the edge leaves the clip polygon, record the intersection point and make a right turn to follow 
the clip polygon in the same manner (i.e., treat the clip polygon as subject polygon and the subject 
polygon as clip polygon and proceed as before). 

Whenever our path of traversal forms a sub-polygon we output the sub-polygon as part of the overall result. 
We then continue to trace the rest of the original subject polygon from a recorded intersection point that 
marks the beginning of a not-yet-traced edge or portion of an edge. Tire algorithm terminates when the 
entire border of the original subject polygon has been traced exactly once. 


Subject polygon 




(a) 


(b) 


Fig. 5-11 


For example, the numbers in Fig. 5-1 1(a) indicate the order in which the edges and portions of edges 
are traced. We begin at the starting vertex and continue along the same edge (from 1 to 2) of the subject 
polygon as it enters the clip polygon. As we move along the edge that is leaving the clip polygon we make 
a right turn (from 4 to 5) onto the clip polygon, which is now considered the subject polygon. Following 
the same logic leads to the next right turn (from 5 to 6) onto the current clip polygon, which is really the 
original subject polygon. With the next step done (from 7 to 8) in the same way we have a sub-polygon for 
output [see Fig. 5-1 1(6)]. We then resume our traversal of the original subject polygon from the recorded 
intersection point where we first changed our course. Going from 9 to 10 to 11 produces no output. After 
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skipping the already-traversed 6 and 7, we continue with 12 and 13 and come to an end. The figure in Fig. 
5-1 1(6) is the final result. 

Applying the Weiler- Atherton algorithm to clip the polygon in Fig. 5-10(«) produces correct result 
[see Fig. 5-12(a) and (6)]. 




(a) (*>) 


Fig. 5-12 


5.5 EXAMPLE: A 2D GRAPHICS PIPELINE 

Shared by many graphics systems is the overall system architecture called the graphics pipeline. The 
operational organization of a 2D graphics pipeline is shown in Fig. 5-13. Although 2D graphics is typically 
treated as a special case (z = 0) of three-dimensional graphics, it demonstrates the common working 
principle and basic application of these pipelined systems. 



Display 
f frame buffer) 


Trans format ions Window & viewport Color attributes 

settings 


Fig. 5-13 A 2D graphics pipeline. 


At the beginning of the pipeline we have object data (e.g., vertex coordinates for lines and polygons 
that make up individual objects) stored in application-specific data structures. A graphics application uses 
system subroutines to initialize and to change, among other things, the transformations that are to be 
applied to the original data, window and viewport settings, and the color attributes of the objects. 
Whenever a drawing subroutine is called to render a pre-defined object, the graphics system first applies 
the specified modeling transformation to the original data, then carries out viewing transformation using 
the current window and viewport settings, and finally performs scan conversion to set the proper pixels in 
the frame buffer with the specified color attributes. 

Suppose that we have an object centered in its own coordinate system [see Fig. 5-14(a)], and w'e are to 
construct a sequence of images that animates the object rotating around its center and moving along a 
circular path in a square display area [see Fig. 5-14(6)]. We generate each image as follows: first rotate the 
object around its center by angle a, then translate the rotated object by offset • I to position its center on the 
positive x axis of the WCS, and rotate it with respect to the origin of the WCS by angle fi. We control the 
amount of the first rotation from one image to the next by Aa, and that of the second rotation by A[i. 
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X 


(a) 



Fig. 5-14 


window! -winsize/2, winsize/2, -winsize/2, winsize/2); 

« = 0 ; 

while ( 1 ) { 

setColor(background); 

clear(); 

setColor( color); 

pushCTMO; 

translate(offset, 0); 
rotate! a); 
drawObjectf); 
popCTM( ); 
a = a + At. 
rotate! A/?); 

} 

We first set the window of winsize by winsize to be sufficiently large and centered at the origin of the 
WCS to cover the entire scene. The system’s default viewport coincides with the unit display area in the 
NDC'S. The default workstation window is the same as the viewport and the default workstation viewport 
corresponds to the whole square display area. 

The graphics system maintains a stack of composite transformation matrices. The CTM on top of the 
stack, called the current CTM, is initially an identity matrix and is automatically used in modeling 
transformation. Each call to translate, scale, and rotate causes the system to generate a corresponding 
transformation matrix and to reset the current CTM to take into account the generated matrix. The order of 
transformation is maintained in such a way that the most recently specified transformation is applied first. 
When pushCTMO is called, the system makes a copy of the current CTM and pushes it onto the stack (now 
we have two copies of the current CTM on the stack). When popCTMf) is called, the system simply 
removes the CTM on top of the stack (now we have restored the CTM that was second to the removed 
CTM to be the current CTM). 


Panning and Zooming 

Two simple camera effects can be achieved by changing the position or size of the window. When the 
position of the window is, for example, moved to the left, an object in the scene that is visible through the 
window would appear moved to the right, much like what we see in the viewfinder when we move or pan a 
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camera. On the other hand, if we fix the window on an object but reduce or increase its size, the object 
would appear bigger (zoom in) or smaller (zoom out), respectively. 


Double Buffering 

Producing an animation sequence by clearing the display screen and constructing the next frame of 
image often leads to flicker, since an image is erased almost as soon as it is completed. An effective 
solution to this problem is to have two frame buffers: one holds the image on display while the system 
draws a new image into the other. Once the new image is drawn, a call that looks like swapBuffer() would 
cause the two buffers to switch their roles. 


Lookup Table Animation 

We can sometimes animate a displayed image in the lookup table representation by changing or 
cycling the color values in the lookup table. For example, we may draw the monochromatic object in Fig. 
5- 14(r/) into the frame buffer in several pre-detennined locations, using consecutive lookup table entries for 
the color attribute in each location (see Fig. 5-15). We initialize lookup table entry 0 with the color of the 
object, and all other entries with the background color. This means that in the beginning the object is 
visible only in its first position (labeled 0). Now if we simply reset entry 0 with the background color and 
entry 1 with the object color, we would have the object “moved” to its second position (labeled 1) without 
redrawing the image. The object’s circular motion could hence be produced by cycling the object color 
through all relevant lookup table entries. 



Fig. 5-15 





102 


TWO-DIMENSIONAL VIEWING AND CLIPPING 


[CHAP. 5 


Solved Problems 


5.1 Let 


t;X — l)X ■ 
^max ^min 

"■■'max — " r n'in 


and 


Omax Omin 
V*T max - M-V m|n 


Express window-to-viewport mapping in the fonn of a composite transformation matrix. 

SOLUTION 


I 0 rx m i n 
N = I 0 1 vy mm 


0 0 


0 s y -S y wy m]n + vy mi 

0 0 



-SvWX m „. + vx. 


I 0 “Trim ^ 

0 1 vrv mln 

\o 0 1 / 


5.2 Find the normalization tTansfonnation that maps a window whose lower left corner is at (1, 1 ) and 
upper right comer is at (3, 5) onto (a) a viewport that is the entire normalized device screen and ( b ) 
a viewport that has lower left comer at (0,0) and upper right comer (|, j). 

SOLUTION 

From Prob. 5.1, we need only identify the appropriate parameters. 

(a) The window parameters are irx mm — 1, \vx max = 3, ur min = I, and vn' max = 5. The viewport parameters 
are “min = “max = 1 . “mm = 0, and ty raax = 1 . Then s t = 1, s y = and 

i\ ° -i 

N=\0 I -i 

Vo 0 1 

( b ) The window parameters are the same as in (a). The viewport parameters are now rx min = 0, rx nlax = ± 
“mm = 0. “max = ?• Then sy = s y = £, and 

(\ o -i 

N=\° 5 

Vo 0 1 

5.3 Find the complete view'ing transfonnation tliat maps a window in world coordinates with x extent 1 
to 1 0 and y extent 1 to 1 0 onto a viewport with x extent 5 to | and y extent 0 to £ in nonnahzed 
device space, and then maps a workstation window with x extent 5 to j and y extent 5 to 5 in the 
nonnahzed device space into a workstation viewport with x extent 1 to 1 0 and y extent 1 to 1 0 on 
the physical display device. 

SOLUTION 

From Prob. 5.1, the parameters for the normalization transfonnation are wx mm = 1, H!i mas = 10, 
“Train = 1. “Tmax = 10, and vx mm = 1 vx ma = |, t;y, mn = 0, and rv inax = \. Then 

_ 1/2 _ 1 1/2 1 

_ ~9~ _ 78 ^-”9 18 



CHAP. 5] 


TWO-DIMENSIONAL VIEWING AND CLIPPING 


103 


and 


N = 



0 

X 

18 

0 



The parameters for the workstation transformation are yix.„,„ = | . «x max = ^ j . wv max = j , 
and rx lnin = 1, rx raax = 10, = I, and t>v imx = 10. Then 




and 


/ 36 0 —8 \ 

W = I 0 36 -8 

V 0 0 1/ 

The complete viewing transformation V is 


1 

/36 

0 

-8\ 

(Vs 

0 

Te\ 


( 2 

0 

— 1 \ 

II 

ii 

0 

36 

-8 

0 

1 

18 

1 

18 

= 

0 

2 

-10 

i 

V 0 

0 

\) 

V 0 

0 

\) 


vo 

0 

1/ 


5.4 Find a normalization transformation from the window whose lower left comer is at (0, 0) and upper 
right comer is at (4, 3) onto the normalized device screen so that aspect ratios are preserved. 

SOLUTION 


The window aspect ratio is = |. Unless otherwise indicated, we shall choose a viewport that is as large 
as possible with respect to the normalized device screen. To this end, we choose the x extent from 0 to I and 
the v extent from 0 to So 

1 4 

° l = 3/4 = 3 


As in Prob. 5.2, with parameters wa inin = 0, = 4, wy min = 0, = 3 and vx min = 0, vx^^. = 1, 

^Vmin *3’max 4 > 


( 


1 

4 


N = 


0 

Vo 


0 o\ 

1 0 

0 1/ 


5.5 Find the normalization transformation N which uses the rectangle A( \, 1), 5(5, 3), C(4. 5), /)((), 3) 
as a window [Fig. 5-16(«)] and the nonnalized device screen as a viewport [Fig. 5-16(5)]. 

SOLUTION 

We will first rotate the rectangle about A so that it is aligned with the coordinate axes. Next, as in Prob. 
5.1, we calculate s x and s v and finally we compose the rotation and the transformation N (from Prob. 5.1) to 
find the required nonnalization transfonnation N R . 

The slope of the line segment AB is 

3-1 1 
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(0, 0) l 


(6) Viewport. 


Fig. 5-16 


Looking at Fig. 5-11, we see that —6 will be the direction of the rotation. The angle 0 is determined from the 
slope of a line (App. 1) by the equation tand = 4. Then 


sin (9 : 


1 

7 !’ 


and so 


sin(— 0) = 

75 


cos 6 = -^=, 

75 


cos (—0) = — = 

75 


The rotation matrix about T(l, 1) is then (Chap. 4, Prob. 4.4): 


r -8,a — 


the distance formula (App. 1) to calculate these lengths yields 

d(A, B) = 72 2 + 4 2 = 720 = 275 d(A. D) = 7l 2 + 2 2 = 75 
Also, the x extent of the normalized device screen is 1 , as is the y extent. Calculating s x and s v . 


/ 2 

1 

\ 3\\ 

75 

75 

v TsJ 

1 

2 

\ 1 \ 

~7I 

75 

K 75/ 

V o 

0 

1 / 

ie length of AB. Similarly, the 


viewport jc extent 1 
window x extent 2 75 


viewport v extent 1 

“ 7 ! 


window v extent 


So 


N = 


/_L 

275 

0 

V o 


0 - 

1 

7 ! 

0 


275 

1 

71 

i ) 


The normalization transformation is then 


n r = n-r_ 9iA = 


/ i J. _J_\ 

I 5 to 10 » 


\ 0 0 


5.6 Let R be the rectangular window whose lower left-hand corner is at L(— 3, 1) and upper right-hand 
comer is at R( 2, 6). Find the region codes for the endpoints in Fig. 5-17. 
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I B(- 1,7) 


I 


D{ 3, 8) 




{ C(-l, 5) 

l ' (0000) 
£(- 2, 3) 


A (-4, 2) 


M-3, 1) 


*«nln = 3 


0 ( 1 , - 2 ) | 


Fig. 5-17 

SOLUTION 

The region code for point (x, y) is set according to the scheme 

Bit I = sign(v - v max ) = signfv - 6) Bit 3 = sign(x - x m .,J = sign(x - 2) 

Bit 2 = sign(y mm - v) = sign(l - y) Bit 4 = sign(x mm - x) = sign(-3 - x) 


. f I if a is positive 
Slgn(a)= i() otherwise 


4-4.2) -► 0001 

F( 1 .2) — C 

5(- 1, 7) -y 1000 

00 ,- 2 )- 

C(— 1, 5) -y 0000 

7/(3, 3) -* ( 

D( 3,8)— ► 1010 

f(— 4, 7) — > 

E(-2, 3) -* 0000 

4-2, 10) - 


* 0100 
0010 
1001 
-> 1000 






106 


TWO-DIMENSIONAL VIEWING AND CLIPPING 


[CHAP. 5 


5.7 Clipping against rectangular windows whose sides are aligned with the x and y axes involves 
computing intersections with vertical and horizontal lines. Find the intersection of a line segment 
P\P 2 [joining P^x, to P 2 (- r 2 >>’ 2 )] with ( a ) the vertical line x = a and ( b ) the horizontal line 
y — b. 

SOLUTION 

We write the equation of P\P 2 in parametric form (App. 1, Prob. A1.23): 

jx = x, +/(x 2 -x,) 0</<1 (51) 

1 v = Vi+t(y 2 -Vi). ~ - (5.2) 

( a ) Sincex = a, we substitute this into equation (5.1) and find t = (a — x,)/(x 2 — x,). Then, substituting this 
value into equation (5.2), we find that the intersection point is x t = a and 

- v ' => '' + (S) (V2_ - v ' ) 

(/)) Substituting y = b into equation (5.2), we find t = (b — y, )/(y 2 — y, ). When this is placed into equation 
(5.1), the intersection point is >7 = h and 



5.8 Find the clipping categories for the line segments in Prob. 5.6 (see Fig. 5-17). 

SOLUTION 

We place the line segments in their appropriate categories by testing the region codes found in Prob. 5.6. 

Category 7 (visible): EF since the region code for both endpoints is 0000. 

Category 2 (not visible): IJ since (1001) AND (1000) = 1000 (which is not 0000). 

Category 3 (candidates for clipping): AB since (0001) AND ( 1000) = 0000, CD since (0000) AND 
(1010) = 0000, and GH since (0100) AND (0010) = 0000. 

5.9 Use the Cohen Sutherland algorithm to clip the line segments in Prob. 5.6 (see Fig. 5-17). 

SOLUTION 

From Prob. 5.8, the candidates for clipping are AB , CD , and GH. 

In clipping AB, the code for A is 000 1 . To push the 1 to 0, we clip against the boundary line x mill = — 3. 
The resulting intersection point is /, (—3. 3 j). We clip (do not display) /!/, and work on 7,7?. The code for 7, is 
0000. The clipping category for /[ B is 3 since (0000) AND (1000) is (0000). Now B is outside the window 
(i.e., its code is 1000), so we push the 1 to a 0 by clipping against the line y max = 6. The resulting intersection 
is 7 2 (— 1 1 , 6). Thus 1 2 B is clipped. The code for 7, is 0000. The remaining segment 7,7 2 is displayed since both 
endpoints lie in the window (i.e., their codes are 0000). 

For clipping CD, we start with D since it is outside the window. Its code is 1010. We push the first 1 to a 0 
by clipping against the line y max = 6. The resulting intersection 7, is (5 , 6) and its code is 0000. Thus 7,D is 
clipped and the remaining segment C7 3 has both endpoints coded 0000 and so it is displayed. 

For clipping GH, we can start with either G or 77 since both are outside the window. The code for G is 
0100, and we push the 1 to a 0 by clipping against the line y mi „ = 1. The resulting intersection point is 
7 4 (2j, 1), and its code is 0010. We clip G7 4 and work on 7 4 77. Segment 7 4 77 is not displayed since (0010) 
AND (0010) = 0010. 

5.10 Clip line segment CD of Prob. 5.6 by using the midpoint subdivision process. 

SOLUTION 

The midpoint subdivision process is based on repeated bisections. To avoid continuing indefinitely, we 
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agree to say that a point (x, , y, ) lies on any of the boundary lines of the rectangle, say, boundary line x = x m . lx , 
for example, if — TOL < x, — x, mx < TOL. Here TOL is a prescribed tolerance, some small number, that is 
set before the process begins. 

To clip CD, we determine that it is in category 3. For this problem we arbitrarily choose TOL = 0.1. We 
find the midpoint of CD to be A/,(l. 6.5). Its code is 1000. 

So M X D is not displayed since (1 000 ) AND (1010) = 1000. We further subdivide CM ; since (0000) 
AND (1000) = 0000. The midpoint of CM , is A/ 2 (0, 5.75): the code for M 2 is 0000. Thus CM 2 is displayed 
since both endpoints are 0000 and M 2 M ] is a candidate for clipping. The midpoint of AT M } is A/ 3 (0.5, 6. 1 25), 
and its code is 1000. Thus M } M { is chipped and M 2 M i is subdivided. The midpoint of M 2 M 3 is 
M 4 (0. 25, 5.9375), whose code is 0000. However, since Vj = 5.9375 Mes within the tolerance 0.1 of the 
boundary line y max = 6 — that is, 6 — 5.9375 = 0.0625 < 0.1, we agree that M 4 lies on the boundary line 
v'max = 6- Thus M 2 M 4 is displayed and M 4 M 2 is not displayed. So the original line segment CD is clipped at 
M 4 and the process stops. 


5.1 1 Suppose that in an implementation of the Cohen Sutherland algorithm we choose boundary lines in 
the top bottom-right left order to clip a line in category 3, draw a picture to show a worst-case 
scenario, i.e., one that involves the highest number of iterations. 

SOLUTION 

See Fig. 5-18. 



5.12 Use the Liang-Barsky algorithm to clip the lines in Fig. 5-19. 



Fig. 5-19 
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SOLUTION 

For line AB , we have 


o 

II 

O 

II 

5 ? 

Pi = o 

ft=-2 

Pi = -4 

43 =4 

2= 

II 

4^ 

44 =2 


Since p 2 = 0 and q 2 < —2, AB is completely outside the right boundary. 
For line CD, we have 


Pt=0 4i = 2 

Pi = 0 <? 2 = 6 



Pi = -3 43 = 5 

ri = ~l 


Pa = 3 44 = 1 

r 4=5 

Thus u ] = max(0, — f) = 0 and 
(3, 7) and (3. 7 + 3(j)) = (3, 8 ). 

u 2 = min( 1 , f) = f. Since u 

! < u 2 , the two endpoints of the clipped line are 

For line EF, we have 

P\=-6 < 7 , = 1 

r — i 

r l - 6 


p 2 = 6 4 2 = 7 

r —2 

T 1 ~ 6 


Pi - “ 1 43 = 1 

f — -1 

G - -y 


Pa = 1 44 = 5 

r 4 = f 

Thus «| = max(0, — 4, — 1) = 0 and u 2 — min(l , l, 5) — 1 . 

Since it\ = 0 and t < 2 = 1, line £F is completely 

inside the clipping window. 



For line GH, we have 

P\ = “2 4i = 5 

s 

r l =~2 


Pi — 2 42 = 3 

r —2 

r l — 2 


Pi=~ 3 43 = 4 

r — 4 

G - 3 


P 4 = 3 44 = 2 

»• —2 
r 4— 3 

Thus w, = inax(0, — | — ±) = 0 and u 2 = min( 1 . § , f) = §• 
line are ( 6 , 6 ) and (6 + 2((:), 6 + 3(^)) = (7 4 , 8 ). 

Since < */ 2 , the two endpoints of the clipped 

For line IJ, we have 

Pi =-12 4i = —2 



p 2 = 12 ^2 = 10 

G=| 


P.3 = 6 43 = 5 

G=f 


P4 = - 6 44 = 1 

r 4 = -i 

Thus «] = max(0, f , —3) = | and u 2 = min( 1, | , |) = |. Since u x < u 2 , the two endpoints of the clipped line 

are (-1 + 12 ( 1 ). 7 + (-6)(f)) = 

= (1. 6 ) and (-1 + 12(f), 7 + (- 6 )(§)) = (9. 2). 


5.13 How can w r e determine whether a point P(x, y) lies to the left or to the right of a line segment 
joining the points H(x,, Vj) and 5(x 2 ,y 2 )‘? 

SOLUTION 

Refer to Fig. 5-20. Fonn the vectors AB and AP. If the point P is to the left of AB, then by the definition 
of the cross product of two vectors (App. 2) the vector AB x AP points in the direction of the vector K 
perpendicular to the xv plane (see Fig. 5-20). If it lies to the right, the cross product points in the direction 
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— K. Now 


So 


AB = (x 2 - xi )I + (v 2 - y x )J AP = (x - x, )1 4- (v - y x )J 


A B x A P = [(.v 2 - x, )(v - v, ) - (y 2 - v, )(x - x, )]K 
Then the direction of this cross product is determined by the number 

C = (x, - x, )(y - v, ) - (y 2 - y x )(x - X! ) 

If C is positive, P lies to the left of AB. If C is negative, then P lies to the right of AB. 



Fig. 5-20 


5.14 Draw a flowchart illustrating the logic of the Sutherland I lodgman algorithm. 

SOLUTION 

The algorithm inputs the vertices of a polygon one at a time. For each input vertex, either zero, one, or 
two output vertices will be generated depending on the relationship of the input vertices to the clipping edge 
E. 

We denote by P the input vertex, S the previous input vertex, and F the first arriving input vertex. The 
vertex or vertices to be output are detennined according to the logic illustrated in the flowchart in Fig. 5-21. 

Recall that a polygon with n vertices P x , P 2 , P H has n edges P X P 2 , ... , P„-\P n and the edge P„P\ closing 

the polygon. In order to avoid the need to duplicate the input of P x as the final input vertex (and a 
corresponding mechanism to duplicate the final output vertex to close the polygon), the closing logic shown in 
the flowchart in Fig. 5-22 is called after processing the final input vertex P„. 


5.15 Clip the polygon P x P 9 in Fig. 5-23 against the window ABCD using the Sutherland 

Hodgman algorithm. 

SOLUTION 

At each stage the new output polygon, whose vertices are determined by applying the Sutherland- 
Hodgman algorithm (Prob. 5. 14), is passed on to the next clipping edge of the window ABCD. The results are 
illustrated in Figs. 5-24 through 5-27. 


5.16 Clip the polygon P l ,...,P xt in Fig. 5-10 against the rectangular clipping window using the 
Sutherland I lodgman algorithm. 

SOLUTION 

We first clip against the top boundary line, then the left, and finally the bottom. The right boundary is 
omitted since it does not affect any vertex list. The intermediate and final results are in Fig. 5-28. 
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Fig. 5-21 
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Fig. 5-22 


Fig. 5-23 



F'ig. 5-24 Clip against AB. 



L | R 

Fig. 5-25 Clip against BC. 


5.17 Use the Weiler-Atherton algorithm to clip the polygon in Fig. 5-29 (a). 
SOLUTION 

See Fig. 5-29(6) and (c). 


5.18 Consider the example in Sect. 5.5, where the object would appear turning slowly around its center 
even if we set Aa = 0. How to keep the orientation of the object constant while making it rotate 
around the center of the display area? 
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Clip against Clip against Clip against 

top boundary left boundary bottom boundary 

Fig. 5-28 



polygon 

(a) 

SOLUTION 

Aa = — AP, i.e., a = — ft. 



vertex 


0) (C) 

Fig. 5-29 


5.19 How to animate the flag in Fig. 5-30(n) that may be in two different positions using lookup table 
animation? 

SOLUTION 

See Fig. 5-30 (b). The area where position 1 overlaps position 2 is assigned entry 0 that has the color of 
the flag. The rest of position 1 is assigned entry 1 and that of position 2 entry 2. Now we only need to alternate 
entries 1 and 2 between the flag color and the background color. 
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(«) 


(b) 


Fig. 5-30 


Supplementary Problems 


5.20 Find the workstation transfonnation that maps the normalized device screen onto a physical device whose x 
extent is 0 to 1 99 and y extent is 0 to 639 where the origin is located at the (a) lower left corner and (b) upper 
left corner of the device. 


5.21 Show that for a viewing transformation, s x = s y if and only if a w = a c , where a K is the aspect ratio of the 
window and a v the aspect ratio of the viewport. 


5.22 Find the normalization transformation which uses a circle of radius five units and center (1, 1) as a window 
and a circle of radius \ and center , j) as a viewport. 


5.23 Describe how clipping a line against a circular window (or viewport) might proceed. Refer to Fig. 5-31. 



i 


i 


Fig. 5-31 


5.24 Use the Sutherland-FIodgman algorithm to clip the line segment joining /*, (— 1 , 2) to P 2 ((>. 4) against the 
rotated window in Prob. 5.5. 



Three-Dimensional 

Transformations 


Manipulation, viewing, and construction of three-dimensional graphic images requires the use of three- 
dimensional geometric and coordinate transformations. These transformations are formed by composing 
the basic transformations of translation, scaling, and rotation. Each of these transformations can be 
represented as a matrix transformation. This pennits more complex transformations to be built up by use of 
matrix multiplication or concatenation. 

As with two-dimensional transformations, two complementary points of view are adopted: either the 
object is manipulated directly through the use of geometric transformations, or the object remains 
stationary and the viewer’s coordinate system is changed by using coordinate transformations. In addition, 
the construction of complex objects and scenes is facilitated by the use of instance transformations. The 
concepts and transformations introduced here are direct generalizations of those introduced in Chap. 4 for 
two-dimensional transformations. 


6.1 GEOMETRIC TRANSFORMATIONS 

With respect to some three-dimensional coordinate system, an object Obj is considered as a set of 

points: 

Obj = {P(x,y, z)} 

If the object is moved to a new position, we can regard it as a new object Obj', all of whose coordinate 
points P'(\ J ,y,zj can be obtained from the original coordinate points P(x, y, z) of Obj through the 
application of a geometric transformation. 


Translation 

An object is displaced a given distance and direction from its original position. The direction and 
displacement of the translation is prescribed by a vector 

V = al + bJ + cK 


1 14 
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The new coordinates of a translated point can be calculated by using the transformation 


T v : 


x' — x + a 
y —y + b 
/ = z + c 


(see Fig. 6-1). In order to represent this transformation as a matrix transformation, we need to use 
homogeneous coordinates (App. 2). The required homogeneous matrix transformation can then be 
expressed as 


(x J \ 


o 

o 

( X ^ 

y 


0 10 6 

y 

t! 


0 0 1 c 

z 

w 


^0 0 0 \ } 

V i / 



Scaling 

The process of scaling changes the dimensions of an object. The scale factor s determines whether the 
scaling is a magnification, s > 1 , or a reduction, s < 1 . 

Scaling with respect to the origin, where the origin remains fixed, is effected by the transformation 

x r = s x ■ x 
z' =S 2 -Z 


In matrix form this is 


J s r ,s y .s : 



0 0 \ 

*, 0 
0 sj 


Rotation 

Rotation in three dimensions is considerably more complex than rotation in two dimensions. In two 
dimensions, a rotation is prescribed by an angle of rotation 0 and a center of rotation P. Three-dimensional 
rotations require the prescriptioin of an angle of rotation and an axis of rotation. The canonical rotations 
are defined when one of the positive x, y, or z coordinate axes is chosen as the axis of rotation. Then the 
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construction of the rotation transformation proceeds just like that of a rotation in two dimensions about the 
origin (see Fig. 6-2). 



Fig. 6-2 


Rotation about the z Axis 

From Chap. 4 we know that 


^ ri.K : 


x' = x cos 0 — y sin 0 
y = xsin 0 + .VCOS0 


Rotation about the y Axis 


An analogous derivation leads to 


^a.J : 


y = xcos0 + 2 sin 0 

y — y 

2 / — —x sin 0+2 cos 0 


Rotation about the x Axis 


Similarly: 


* 0.1 


x' — x 

y = y cos 0 — 2 sin 0 
2 / =_vsinO + 2 cos 0 


Note that the direction of a positive angle of rotation is chosen in accordance to the right-hand rule with 
respect to the axis of rotation (App. 2). 

The corresponding matrix transformations are 


^e,K — 


^tf.J — 


^s.i — 


/ cos 0 — sin 0 O' 
sin 0 cos 0 0 


V 

0 

0 

1 

/ cosO 

0 

sinO 


0 

1 

0 

V- 

sinO 

0 

cosO 

/* 

0 


0 

b 

cos 0 


- sin 0 

Vo 

sinO 


cos 0 
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The general use of rotation about an axis L can be built up from these canonical rotations using matrix 
multiplication (Prob. 6.3). 

6.2 COORDINATE TRANSFORMATIONS 

We can also achieve the effects of translation, scaling, and rotation by moving the observer who views 
the object and by keeping the object stationary. This type of transformation is called a coordinate 



x 


Fig. 6-3 

transformation. We first attach a coordinate system to the observer and then move the observer and the 
attached coordinate system. Next, we recalculate the coordinates of the observed object with respect to this 
new observer coordinate system. The new coordinate values will be exactly the same as if the observer had 
remained stationary and the object had moved, corresponding to a geometric transformation (see Fig. 6-3). 

If the displacement of the observer coordinate system to a new position is prescribed by a vector 
V — al 4- hi + cK, a point P(x, y, z) in the original coordinate system has coordinates P(x',f. z ) in the 
new coordinate system, and 

x' = x — a 

f v : • / =y- b 

z! = z — c 

The derivation of tins transformation is completely analogous to that of the two-dimensional transforma- 
tion (see Chap. 4). 

Similar derivations hold for coordinate scaling and coordinate rotation transformations. 

As in the two-dimensional case, we summarize the relationships between the matrix forms of the 
coordinate transformations and the geometeric transformations: 

Coordinate Transformations Geometric Transformations 

Translation TV 7\ v 

Rotation R H R _ tl 

Scaling ^s x ,s r .s. ^i/s x .i/s y ,i/s z 

Inverse geometric and coordinate transformations are constructed by performing the reverse operation. 
Thus, for coordinate transformations (and similarly for geometric transformations): 

Ty 1 = T_ v Rfj 1 = R_ e = ^]/.vi/.5 v ,i/j J 

6.3 COMPOSITE TRANSFORMATIONS 

More complex geometric and coordinate transformations are fonned through the process of 
composition of functions. For matrix functions, however, the process of composition is equivalent to 



118 


THREE-DIMENSIONAL TRANSFORMATIONS 


[CHAP. 6 


matrix multiplication or concatenation. In Probs. 6.2, 6.3, 6.5, and 6.13, the following transformations are 
constructed: 

1 . ri V N = aligning a vector V with a vector N. 

2. R r) [ = rotation about an axis L. This axis is prescribed by giving a direction vector V and a point 
P through which the axis passes. 

3. S s s ^ s P - scaling with respect to an arbitrary point P. 

In order to build these more complex transformations through matrix concatenation, we must be able 
to multiply translation matrices with rotation and scaling matrices. This necessitates the use of 
homogeneous coordinates and 4x4 matrices (App. 2). The standard 3x3 matrices of rotation and 
scaling can be represented as 4 x 4 homogeneous matrices by adjoining an extra row and column as 
follows: 


/ a b c O'* 
d e f 0 
g h i 0 
\ 0 0 0 1 / 

These transformations are then applied to points P(x,y,z) having the homogeneous form: 

(x\ 

y 

z 

w 

EXAMPLE 1 . The matrix of rotation about the y axis has the homogeneous 4x4 form: 


( cos ft 

0 

sin 0 

°\ 

0 

1 

0 

0 

—sin 8 

0 

cos 8 

0 

l o 

0 

0 

1 / 


6.4 INSTANCE TRANSFORMATIONS 

If an object is created and described in coordinates with respect to its own object coordinate space, we 
can place an instance or copy of it within a larger scene that is described in an independent coordinate 
space by the use of three-dimensional coordinate transformations. In this case, the transformations are 
referred to as instance transformations. The concepts and construction of three-dimensional instance 
transformations and the composite transformation matrix are completely analogous to the two-dimensional 
cases described in Chap. 4. 


Solved Problems 


6.1 Define tilting as a rotation about the x axis followed by a rotation about the v axis: (a) find the tilting 
matrix; (b) does the order of performing the rotation matter? 
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SOLUTION 

(a) We can find the required transfonnation T by composing (concatenating) two rotation matrices: 

T = ■ Re „ i 


( COS 0y 

0 

sin 0 y 0 ) 


{ 1 

0 

0 

°\ 

0 

1 

0 0 


0 

cos 9 X 

— sin 9 X 

0 

— sin 0 y 

0 

COS Oy 0 


0 

sinfl x 

COS0 X 

0 

y. o 

0 

0 1) 


U 

0 

0 

W 

/ cos 0 V 

sin0 y sm6 x sin 0 y 

cos 0 X 

0> 


» 


cos 9 X 


- sin 0 X 

0 



I —sin 0 y cos 6 y sm 0 X cos 0 y cos 0 t 0 
\ 0 0 0 I / 

(b) We multiply R# j • R s j to obtain the matrix 

/ cosfi,, 0 sin0 v 0\ 

sin 9 X sin 0 t cos 0 X — sin 6 X cos 0 , 0 

— cos 0 X sin 0 V sin 0 X cos 0 X cos 0 

\ o 0 01/ 

This is not the same matrix as in part a; thus the order of rotation matters. 


6.2 Find a transformation Ay which aligns a given vector V with the vector K along the positive z axis. 

SOLUTION 

See Fig. 6-4(«). Let V = al + bJ 4- (K We perform the alignment through the following sequence of 
transformations [Figs. 6-4 (b) and 6-4(c)]: 

1 . Rotate about the x axis bv an angle 9 X so that V rotates into the upper half of the xz plane (as the vector 
V,). 

2. Rotate the vector Vj about the v axis by an angle —0 2 so that V, rotates to the positive z axis (as the 
vector V 2 ). 

Implementing step 1 from Fig. 6-4(b), we observe that the required angle of rotation 0, can be found by 
looking at the projection of V onto the yz plane. (We assume that b and c are not both zero.) From triangle 
OP'B : 


sin 0, = 


s/vT7- 


COS 01 = 


sfvT?- 


The required rotation is 


R&i.i = 


/' 

o 

0 

Vo 


o 


0 


VP+C 2 Vb 2 + C 2 

b c 


s/ b 2 + c 1 s/b 1 + e 1 


0 


0 

0 

1/ 


Applying this rotation to the vector V produces the vector V, with the components (a, 0. s/b 2 + c 2 ). 
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6.3 


Implementing step 2 from Fig. 6-4(c), we see that a rotation of —0 2 degrees is required, and so from 
triangle OQQ'\ 


sin(— 0 2 ) = — sin 0 2 =- 


Then 


<Sa 2 + b 2 +c 2 

I sfFTF- 


and eos( — 0 2 ) = cos 0, = ^ 2 c i — 

2 2 Va 2 +b 2 +c 2 


— 


0 


\ 


\/a 2 + b 2 +c 2 

0 1 


0 


0 


s /a 2 + b 2 4- c 2 

0 0 


Va 2 + b 2 + c 2 
0 0 
sfW T? 0 


s/a 2 +b 2 +c 2 

0 1/ 

Since |V| = s/a 2 + b 2 + c 2 , and introducing the notation A = s/b 1 + c 2 , we find 

(± Zf?£ 

IVI A|V| A|V| 

c—b 

0 A T 0 

— — — Q 

I VI | VI I VI 

Vo o 01 / 

If both b and c are zero, then V = a I, and so A = 0. In this case, only a ±90 rotation about they axis 
is required. So if A = 0, it follows that 

/o o — o\ 


— 


l°l 

0 10 0 

a 


M 


0 0 0 


Vo 0 0 1 / 

In the same manner we calculate the inverse transformation that aligns the vector K with the vector V: 



*M>' 

-1 

/ A 

0 

a 

|V| 

|V| 

—ab 

c 

b 

A|V| 

A 

|V| 

—ae 

b 

C 

A|V| 

_ A 

|V| 

V 0 

0 

0 


T TCTT 0 


—— 0 


Let an axis of rotation L be specified by a direction vector V and a location point P. Find the 
transformation for a rotation of 9° about L. Refer to Fig. 6-5. 



122 


THREE-DIMENSIONAL TRANSFORMATIONS 


[CHAP. 6 




SOLUTION 

We can find the required transformation by the following steps: 

1 . Translate P to the origin. 

2. Align V with the vector K. 

3. Rotate by 6 about K. 

4. Reverse steps 2 and 1. 

So 


Rg L — T-P • dy Rt< k ' y *V • T_p 


Here, A y is the transformation described in Prob. 6.2. 


6.4 The pyramid defined by the coordinates ,4(0, 0, 0), 5(1, 0, 0), C(0, 1, 0), and D( 0, 0, 1) is rotated 
45 about the line L that has the direction V = J + K and passing through point C(0, 1, 0) (Fig. 
6-6). Find the coordinates of the rotated figure. 

SOLUTION 

From Prob. 6.3, the rotation matrix Rg L can be found by concatenating the matrices 

Rq l = F_ p ' -dy * Rg |v ■ ,4y * T—p 


/ 1 00 0 \ 

0 10-1 
0 0 1 0 

\0 0 0 1 / 


With P = (0, 1.0), then 
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Now V = J + K. So from Prob. 6.2, with a = 0, b = 1, c = 1, we find X = 72, |V| = s/2, and 

(\ 0 00^ 

0 


sTi 75 

l i 


0 


72 72 

\o 0 01/ 


/I 0 0 0\ 

I 


Also 


^4 5 ,K 


-L ± 0 0^ 

72 s/2 


1 


I 


0 0 
1 0 

\ 0 001/ 


72 72 
0 0 


r-1 


0 vf Ti 0 

0 - > 0 

72 72 

Vo o 01/ 


(\ 0 0 0\ 
0 10 1 
0 0 10 
\0 0 0 1 / 


Then 




Rf).L — 


I \ 


1 2+ 72 2- 72 2- 72 

2 4 4 4 

1 2- 72 2 + 72 72-2 
2 4 4 4 

V 0 0 0 1 / 

To find the coordinates of the rotated figure, we apply the rotation matrix R, ) L to the matrix of homogeneous 
coordinates of the vertices A,B, C, and D : 


C = (A BCD) = 


/0 1 0 0\ 
0 0 10 
0 0 0 1 
V 1 ' 1/ 


So 


Rgr • C = 


< I !±^ 0 , 

2 2 

2-72 4-72 , 2-72 


4 4 

72-2 72-4 
4 4 

V * 11 


2 

7? 

2 

1 ) 


The rotated coordinates are (Fig. 6-7) 

H 2-72 72-2) 




4 ’ 4 / 

1 + 72 4 - 72 72 -4 




C' = (0, 1.0) 
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Fig. 6-7 


6.5 Find a transformation /1 V N which aligns a vector V with a vector N. 

SOLUTION 

We form the transfonnation in two steps. First, align V with vector K, and second, align vector K with 
vector N. So from Prob. 6,2, 


A\',N — ' 1 Ay 

Referring to Prob. 6. 1 2, we could also get A v N by rotating V towards N about the axis V x N. 

6.6 Find the transformation for mirror reflection with respect to the xy plane. 

SOLUTION 

From Fig. 6-8, it is easy to see that the reflection of P(x,v, z) is P(x.y, —z). The transformation that 
performs this reflection is 

/ 1 0 0 

M = 0 1 0 

\0 0 -1 

6.7 Find the transformation for mirror reflection with respect to a given plane. Refer to Fig. 6-9. 

SOLUTION 

Let the plane of reflection be specified by a nomial vector N and a reference point P 0 (x 0 , y 0 , z 0 ). To 
reduce the reflection to a mirror reflection with respect to the xy plane: 

1. Translate P () to the origin: 

2. Align the nonnal vector N with the vector K nonnal to the .w plane. 

3. Perform the mirror reflection in the .rv plane (Prob. 6.6). 

4. Reverse steps 1 and 2. 

So, with translation vector V = — x 0 I — y 0 J — z 0 K 

M n ^ = 7V'-V M A N T y 

Here, /1 N is the alignment matrix defined in Prob. 6.2. So if the vector N = //,! + n 2 J 4- n t K. then from Prob. 
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6.8 




Fig. 6-9 


6.2, with |N| = s /n\ + n\ + n\ and A j= + n\, we find 


— 


/ A 

— «1«2 

-«1«3 

°1 


/ 

X 

0 

« l 

0 \ 

INI 

A|N| 

A|N| 



W\ 


|N| 


0 

«3 

“«2 

0 



-«l«2 

'll 

n 2 

0 

X 

X 

and /1 N ' = 


m 

X 

|N| 


II 2 

»3 

0 




—n 2 

"3 

0 

IM 

IN 

|N| 



A|N| 

X 

|N| 

\ o 

0 

0 

1 ) 



0 

0 

0 

1/ 


In addition 



( 1 

0 

0 

-*<>\ 

n 

0 

0 

r 0 > 

Ty = 

0 

0 

1 

0 

0 

1 

0 o 

1 1 

and Ty 1 = I q 

1 

0 

0 

1 

Vo 

-0 



0 

0 

1 ) 

\o 

0 

0 

1 / 


Finally, from Prob. 6.6, the homogeneous form of M is 


M = 


/I 0 0 0\ 

0 1 0 0 
0 0-10 
\0 0 01 / 


Find the matrix for mirror reflection with respect to the plane passing through the origin and having 
a normal vector whose direction is N = I + J + K. 



126 


THREE-DIMENSIONAL TRANSFORMATIONS 


[CHAP. 6 


SOLUTION 

From Prob. 6.7, with P o (0. 0, 0) and N = I + J + K, we find |N| = 73 and A = 72. Then 


(\ 0 

0 0\ 




/l 

0 

0 

°\ 

0 1 

0 0 




0 

1 

0 

0 



(V = 01 + 0J + OK) 

Ty' 

= 





0 0 

1 0 




0 

0 

1 

0 

Vo 0 

o iy 




^0 

0 

0 

1/ 

/V2 

-i 

7573 


( ^ 

0 

1 

0^ 


73 

7273 


73 

73 


0 

1 

72 


-1 

1 


1 

0 


7! 

A - 1 _ 

7273 

72 

7 ! 


1 

1 

‘ 0 

73 


-1 

— 

1 

-1 

0 


7 ! 

7 ! 


7273 

72 

7 ? 


V o 

0 

o iy 


V 0 

0 

0 

l) 



and 


The reflection matrix is 


(\ 0 

0 1 

0 0 

\0 0 


0 0\ 
0 0 
-1 0 
0 1 / 


A^N.O 


= 7V 1 -V MA S 



_ 2 
3 

2 

3 

( 0 


2 _2 

3 3 

1 2 

3 3 

2 1 

3 3 

0 0 


■7V 

0 

0 

ly 


Supplementary Problems 


6.9 Align the vector V = I + J + K with the vector K. 

6.10 Find a transformation which aligns the vector V = I + J + K with the vector N = 21 — J — K. 

6.11 Show that the alignment transformation satisfies the relation /!“' = . 

6.12 Show that the alignment transformation /4 V N is equivalent to a rotation of 6 about an axis having the 
direction of the vector VxN and passing through the origin (see Fig. 6-10). Here 0 is the angle between 
vectors V and N. 
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Fig. 6-10 


6.13 How can scaling with respect to a point P n (x 0 ,v 0 , z 0 ) be defined in terms of scaling with respect to the origin? 





Mathematics of 
Projection 


Needless to say, there are fundamental differences between the true three-dimensional world and its 
pictorial description. For centuries, artists, engineers, designers, drafters, and architects have tried to come 
to terms with the difficulties and constraints imposed by the problem of representing a three-dimensional 
object or scene in a two-dimensional medium — the problem of projection. The implemented of a computer 
graphics system face the same challenge. 

Projection can be defined as a mapping of point P (x, y, z ) onto its image P' (x\y, z') in the projection 
plane or view plane, which constitutes the display surface (see Fig. 7-1). The mapping is determined by a 
projection line called the projector that passes through P and intersects the view plane. The intersection 

point is P' . 



Fig. 7-1 The problem of projection. 


The result of projecting an object is dependent on the spatial relationship among the projectors that 
project the points on the object, and the spatial relationship between the projectors and the view plane (see 
Sec. 7. 1 ). An important observation is that projection preserves fines. That is, the fine joining the projected 
images of the endpoints of the original line is the same as the projection of that fine. 

The two basic methods of projection perspective and parallel are designed to solve the basic but 
mutually exclusive problems of pictorial representation: showing an object as it appears and preserving its 
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true size and shape. We characterize each method and introduce the mathematical description of the 
projection process in Sec. 7.2 and 7.3, respectively. 


7.1 TAXONOMY OF PROJECTION 

We can construct different projections according to the view that is desired. 

Figure 7-2 provides a taxonomy of the families of perspective and parallel projections. Some 
projections have names — cavalier, cabinet, isometric, and so on. Other projections qualify the main 
type of projection — one principal vanishing-point perspective, and so forth. 

Projections 


Perspective Parallel 

(converging projectors) (parallel projectors) 





Orthographic 

Oblique 

One point 

Two point 

Three point 

(projectors perpendicular 

(projectors not perpendicular 

(one 

(two 

(three 

to view plane) 


to view plane) 

principal 

principal 

principal 




vanishing 

vanishing 

vanishing 




point) 

points) 

points) 

Multi view 

Axonometric 

General 




(view plane 

(view plane 





parallel to 

not parallel to 

/ \ 




principal 

principal 

Cavalier Cabinet 




planes) 

planes) 



Isometric Dimetric Trimetric 
Fig. 7-2 Taxonomy of projection. 


7.2 PERSPECTIVE PROJECTION 
Basic Principles 

The techniques of perspective projection are generalizations of the principles used by artists in 
preparing perspective drawings of three-dimensional objects and scenes. The eye of the artist is placed at 
the center of projection , and the canvas, or more precisely the plane containing the canvas, becomes the 
view plane. An image point is determined by a projector that goes from an object point to the center of 
projection (see Fig. 7-3). 

Perspective drawings are characterized by perspective foreshortening and vanishing points. Perspective 
foreshortening is the illusion that objects and lengths appear smaller as their distance from the center of 
projection increases. The illusion that certain sets of parallel lines appear to meet at a point is another 
feature of perspective drawings. These points are called vanishing points. Principal vanishing points are 
formed by the apparent intersection of lines parallel to one of the three principal x,y, or z axes. The number 
of principal vanishing points is determined by the number of principal axes intersected by the view plane 
(Prop. 7.7). 


Mathematical Description of a Perspective Projection 

A perspective transformation is determined by prescribing a center of projection and a view plane. The 
view plane is determined by its view reference point R 0 and view plane normal N. The object point P is 
located in world coordinates at (x,y,z). The problem is to determine the image point coordinates 
Fix',)/, z!) (see Fig. 7-3). 
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EXAMPLE 1 . The standard perspective projection is shown in Fig. 7-4. Here, the view plane is the xy plane, and 
the center of projection is taken as the point C(0, 0, —d) on the negative z axis. 

Using similar triangles ABC and A'OC, we find 

c = C = C = 0 

z + d ■ z + d 

The perspective transformation between object and image point is nonlinear and so cannot be represented as a 
3x3 matrix transformation. However, if we use homogeneous coordinates, the perspective transformation can be 
represented as a 4 x 4 matrix: 


/x'\ 

y 


{ d -x\ 
dy 


Id 0 0 0 \ 
0 d 0 0 

< x\ 

y 

w 


0 

\2+d) 


0 0 0 0 
^0 0 1 d) 

y 


The general form of a perspective transformation is developed in Prob. 7.5. 


Perspective Anomalies 

The process of constructing a perspective view introduces certain anomalies which enhance realism in 
terms of depth cues but also distort actual sizes and shapes. 

1. Perspective foreshortening. The farther an object is from the center of projection, the smaller it 
appears (i.e. its projected size becomes smaller). Refer to Fig. 7-5. 
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View plane 



sphere A, yet both spheres appear to be the 
same size when projected onto the view plane 


Fig. 7-5 


2. Vanishing points. Pro jections of lines that are not parallel to the view plane (i.e. lines that are not 
perpendicular to the view plane normal) appear to meet at some point on the view plane. A 
common manifestation of this anomaly is the illusion that railroad tracks meet at a point on the 
horizon. 

EXAMPLE 2. For the standard perspective projection, the projections Zj and Zj of parallel lines Z, and Z 2 having 
the direction of the vector K appear to meet at the origin (Prob. 7.8). Refer to Fig. 7-6. 



3. View confusion. Objects behind the center of projection are projected upside down and backward 
onto tire view plane. Refer to Fig. 7-7. 

4. Topological distortion. Consider the plane that passes through the center of projection and is 
parallel to the view plane. The points of this plane are projected to infinity by the perspective 
transformation. In particular, a finite line segment joining a point which lies in front of the viewer 
to a point in back of the viewer is actually projected to a broken line of infinite extent (Prob. 7.2) 
(see Fig. 7-8). 
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Fig. 7-8 


7.3 PARALLEL PROJECTION 
Basic Principles 

Parallel projection methods are used by drafters and engineers to create working drawings of an object 
which preserves its scale and shape. The complete representation of these details often requires two or 
more views (projections) of the object onto different view planes. 

In parallel projection, image points are found as the intersection of the view plane with a projector 
drawn from the object point and having a fixed direction (see Fig. 7-9). The direction of projection is the 
prescribed direction for all projectors. Orthographic projections are characterized by the fact that the 
direction of projection is perpendicular to the view plane. When the direction of projection is parallel to 
any of the principal axes, this produces the front, top, and side views of mechanical drawings (also referred 
to as multiview drawings). Axonometric projections are orthographic projections in which the direction of 
projection is not parallel to any of the three principal axes. Nonorthograhic parallel projections are called 
oblique parallel projections. Further subcategories of these main types of parallel projection are described 
in the problems. (See also Fig. 7-10.) 


Mathematical Description of a Parallel Projection 

A parallel projective transformation is determined by prescribing a direction of projection vector V 
and a view plane. The view plane is specified by its view plane reference point R 0 , and view plane normal 
N. The object point P is located at (x,y, z) in world coordinates. Tire problem is to determine the image 
point coordinates Pj.x’.f, /). See Fig. 7-9. 

If the projection vector V has the direction of the view plane normal N, the projection is said to be 
orthographic. Otherwise it is called oblique (see Fig. 7-10). 

Some common subcategories of orthographic projections are: 

1 . Isometric — the direction of projection makes equal angles with all of the three principal axes 
(Prob. 7.14). 

2. Dimetric — the direction of projection makes equal angles with exactly two of the principal axes 
(Prob. 7.15). 

3. Trimetric the direction of projection makes unequal angles with the three principal axes. 
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Fig. 7-9 Fig. 7-10 


Some common subcategories of oblique projections are: 

1. Cavalier — the direction of projection is chosen so that there is no foreshortening of lines 
perpendicular to the an-' plane (Prob. 7. 1 3). 

2. Cabinet the direction of projection is chosen so that lines perpendicular to the xy planes are 
foreshortened by half their lengths (Prob. 7.13). 

EXAMPLE 3. For orthographic projection onto the xv plane, from Fig. 7-1 1 it is easy to see that 

f x'=x 
Far K : { V — y 


The matrix form o f Par K is 


/I 

0 

0 

0\ 

0 

1 

0 

0 

0 

0 

0 

0 


0 

0 

V 


The general parallel projective transfonnation is derived in Prob. 7.1 1 . 


Solved Problems 


7.1 The unit cube (Fig. 7-12) is projected onto the xy plane. Note the position of the x, y, and z axes. 
Draw the projected image using the standard perspective transfonnation with (a) d — 1 and (b) 
d — 10, where d is distance from the view plane. 
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SOLUTION 

We represent the unit eube in tent 


V = (ABCD. 



k 


z 
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From Example 1 the standard perspective matrix is 


Per K 


Id 0 0 0\ 
0 d 0 0 I 
0 0 0 0 
K 0 0 1 d) 


(a) With d — 1 , the projected coordinates are found by applying the matrix Per K to the matrix of coordinates 
V. Then 


Per K ■ V = 


/O I 10 0 0 1 1 \ 

0 0 1110 0 1 
00000000 
^1 1 1 1 2 2 2 2 } 


If these homogeneous coordinates are changed to three-dimensional coordinates, the projected 
image has coordinates: 


/I' = (0.0,0) £' = (0.±,0) 

£' = (1.0,0) £' = (0.0,0) 

C =(1,1,0) G' = (1,0,0) 

a = (o,i.o) h' = (|,|,o) 


We draw the projected image by preserving the edge connections of the original object (see Fig. 7- 
13). [Note the vanishing point at (0, 0, 0).] 

(b) With d = 10, the perspective matrix is 


Then 


Per K \ = 


( 10 

0 

0 

°\ 

0 

10 

0 

0 

0 

0 

0 

0 

l 0 

0 

1 

10 / 


( 0 

10 

10 

0 

0 

0 

10 

10\ 

0 

0 

10 

10 

10 

0 

0 

10 

0 

0 

0 

0 

0 

0 

0 

0 

V 10 

10 

10 

10 

11 

11 

11 

11/ 


is the matrix image coordinates in homogeneous form. The projected image coordinates are then 

/I' = (0.0,0) £' = (0,lf.0) 

8 = (1,0,0) £' = (0.0.0) 

C' = (1,1.0) Cr = (jy,0, 0) 

D' = (0,1,0) /£ = ({?. lf,0) 


Note the different perspectives of the face E'F'G'H' in Figs. 7-13 and 7-14. [To a viewer standing at the 
center of projection (0, 0, —d), this face is the back face of the unit cube.] 


7.2 Under the standard perspective transformation Per K , what is the projected image of (a) a point in 
the plane z = -d and (b) the line segment joining £,(- 1 , 1 , -2d) to P 2 (2, —2, 0)? (See Fig. 7-15.) 

SOLUTION 

(a) The plane z = —d is the plane parallel to the xy view plane and located at the center of projection 
C(0. 0, — d). If P(x,y, —d) is any point in this plane, the line of projection CP does not intersect the xy 
view plane. We then say that P is projected out to infinity (oo). 

(h) The line P\P 2 passes through the plane z = —d. Writing the equation of the line (App. 2), we have 

a- =-1+3/ _y = 1 — 3f z=-2d + 2dt 




oo 


Pig. 7-15 
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We see that at / = | : a- = |, y = — and z = —d. These are the coordinates of the intersection point I. 
We now describe the perspective projection of this line segment. 

Applying the standard projection to the equation of the line, we find 


Id 0 0 0\ 
0 d 0 0 
0 0 0 0 
^0 0 I dj 


( “ 1+3r \ 

1 - 3/ 

—2d + 2d I 

V i 


I -d + 3 dt \ 
d — 3dt 
0 

v —d + 2d! ) 


Changing from homogeneous to three-dimensional coordinates, the equations of the projected line 
segment are 

-d + 3dt -1+3/ d - 3dt 1-3/ 

X ~ -d + 2 dt ~ - 1 + 2 / - V " -d + 2 dt ~ - 1 + 2 / 2 - 

(In App. 1, Prob. A 1.1 2, it is shown that this is the equation of a line.) When / = 0, then x = 1 and 
y = — 1 . These are the coordinates of the projection P\ of point P, . When / = 1 , it follows that x = 2 and 
y = —2 (the coordinates of the projection P' 2 of point P 2 ). However, when / = the denominator is 0. 
Thus this line segment “passes” through the point at infinity in joining P\ ( 1 , — 1 ) to Pj(2, —2). In other 
words, when a line segment joining endpoints P, and P 2 passes through the plane containing the center 
of projection and which is parallel to the view plane, the projection of this line segment is not the simple 
line segment joining the projected endpoints P\ and PC (See also Prob. A 1.1 3 in App. 1.) 


7.3 Using the origin as the center of projection, derive the perspective transformation onto the plane 
passing through the point P 0 (.v 0 ,_V(j, z 0 ) and having the normal vector N = n x I + n 2 J + rt^K. 

SOLUTION 

Let P(x,y, z) be projected onto P'(x', v'. z ). From Fig. 7-16, the vectors PO and P O have the same 
direction. Thus there is a number ot so that P O = aPO. Comparing components, we have 

x = xx y = ay z = az 



Fig. 7-16 
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We now find the value of a. Since any point P'(x',y', z ) lying on the plane satisfies the equation (App. 2) 

n x x! + n 2 y + nyZ = d a 

(where d 0 = n | x 0 + /ljV'o + n x z 0 ), substitution of x = ax, yf = ay, and z = az into this equation gives 

« = 

n x x + tijy + ti 3 z 

This projection transformation cannot be represented as a 3 x 3 matrix transformation. However, by 
using the homogeneous coordinate representation for three-dimensional points, we can write the projection 
transformation as a 4 x 4 matrix: 


(d 0 

0 

0 

0N \ 

0 

do 

0 

0 

0 

0 

do 

0 

\”i 

n 2 

«3 

<V 


Application of this matrix to the homogeneous representation P(x,y,z , 1) of points P gives P’(d a x, </ a v, 
d 0 z, n x x + n-jy 4- n 3 z), which is the homogeneous representation of P\x, y\ z') found above. 


7.4 Find the perspective projection onto the view plane z = d where the center of projection is the 
origin (0, 0, 0). 

SOLUTION 


The plane z = d is parallel to the xy plane (and d units away from it). Thus the view plane normal vector 
N is the same as the nonnal vector K to the xy plane, that is, N = K. Choosing the view reference point as 
R u ( 0, 0, d), then from Prob. 7.3, we identify the parameters 

N(fl, , n 2 , n 3 ) = (0, 0, 1) R 0 (x 0 ,y 0 ,z 0 ) - (O.O.d) 

So 


and then the projection matrix is 


d 0 = n i* 0 + « z v„ + n 3 z 0 = d 


(d 

0 

0 

°\ 

0 

d 

0 

0 

0 

0 

d 

0 


0 

1 

0/ 


7.5 Derive the general perspective transformation onto a plane with reference point R 0 (x 0 , y Q , z 0 ), 
nonnal vector N = n x \ + ;? 2 J + n^K, and using C(a, b, c) as the center of projection. Refer to Fig. 
7-17. 

SOLUTION 

As in Prob. 7.3, we can conclude that the vectors PC and P C satisfy (see Fig. 7-17) P C = zPC Then 
x' = a(x — a) + a V = a(y — b) + b z = x(z — c) + c 
Also, we find (by using the equation of the view plane) that 

d 

«! (x - a) + n 2 (y - b) + n 3 (z - c ) 

[i.e. P‘(x',y f .z f ) is on the view plane and thus satisfies the view plane equation w,(x' — x 0 ) + 
«2 W -v 0 ) + '>)(- ~ -o) = °J- Here ' d = («i*o + «2 To + n 3 z 0 ) - (n,n + n 2 b + n 3 c). 

From App. 2, Prob. A2.13, d is proportional to the distance D from the view plane to the center of 
projection, that is, d = ±|N|D. 
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Fig. 7-17 

To find the homogeneous coordinate matrix representation, it is easiest to proceed as follows: 

1 . Translate so that the center of projection C lies at the origin. Now R' () = (x 0 — a , v 0 — b, z a — c) 
becomes the reference point of the translated plane (the normal vector is unchanged by translation). 

2. Project onto the translated plane using the origin as the center of projection by constructing the 

transformation (Prob. 7.3). 

3. Translate back. 

Introducing the intermediate quantities 

d 0 = n,x 0 + j?o + n 3 c 0 and d\ = n x a + n 2 b + fijc 

we obtain d = d n — d x , and so Pcr N fl(| C = T c ■ Per N K ■ T_ c . Then with R' () used as the reference point in 
constructing the projection / > Nvg < , 



7.6 Find the (a) vanishing points for a given perspective transformation in the direction given by a 
vector U and (b) principal vanishing points. 

SOLUTION 

(a) The family of (parallel) lines having the direction of U = a, 1 4- it-iJ + can be written in parametric 
form as 


X = U\t +p 


y — u^t 4- tj 


: = u 3 r + r 
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where P(p, q, r ) is any point (see App. 2). Application of the perspective transformation (Prob. 7.5) to 
the homogeneous point (x, y, z, 1 ) produces the result (x', V ,z' , h ), where 

x! = (d + an x )(u x t + p) +an 2 (u 2 t + q) + an 3 (u 3 t 4r) — ad u 
V = bn ] (u ] t 4p) + (d 4 bn 2 )(u 2 t 4 q) + bn 3 (u } t + r) - bd 0 
z! = at i (u i t+p) + cn 2 (u 2 t + q) + (d+ c%)(M 3 i 4 r) - cd 0 
h = «, («, t+p) + n 2 (u 2 t ~\~q) + n 3 (u 3 t + r ) — d x 


The vanishing point corresponds to the infinite point obtained when t = o o. So after dividing r',/, and 
z by h, we let l — ► oo to find the coordinates of the vanishing point: 


(d + o/7, )W| + an 2 u 2 4- 0 /I 3 M 3 
k 


= a + 


du 1 


(Here, k = N • If = n x u x 4- n 2 a 2 + n 3 u 3 .) 


v„ = 


/>«!«! 4 - {d + bn 2 )u 2 4- bn : ,u 3 

-b+^± 

k 

- h+ k 

4- cn 2 u 2 4 (d + cn 3 )it 3 

dit\ 

k 

- c+ k 


This point lies on the line passing through the center of projection and parallel to the vector U (see Fig. 
7-18). Note that k — 0 only when U is parallel to the projection plane, in which case there is no vanishing 
point. 



Fig. 7-18 


(b) The principal vanishing points P\, P 2 , and P 3 correspond to the vector directions 1, J, and K. In these 
cases 


Pv 

X, 

y\ 

d 

= a 4 — 
n 1 
= b 

P 2 :■ 

x 2 = a 

L d 

V? = b H — 
w 2 

Py 


-\ 

= c 


-2 — C 



x 3 = a 
y 3 = b 

d 

Zy = C H 


(Recall from Prob. 7.5 that a, b, c are the coordinates of the center of projection. Also, /?, , n 2 , n 3 are the 
components of the view plane normal vector and d is proportional to the distance D from the view plane 
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to the center of projection.) (Note: If any of the components of the normal vector are zero, say, ti\ = 0, 
then A- = N I = 0, and there is no principal vanishing point in the I direction.) 


7.7 Describe the (a) one-principal-vanishing-point perspective, (b) two-principal -vanishing-point 
perspective, and (c) tliree-principal-vanishing-point perspective. 

SOLUTION 

(a) The one-principal-vanishing-point perspective occurs when the projection plane is perpendicular to one 
of the principal axes (x, y, or z). Assume that it is the z axis. In this case the view plane normal vector N 
is the vector K, and from prob. 7.6, the principal vanishing point is 


x 3 =a 



(b) The two-principal-vanishing-point projection occurs when the projection plane intersects exactly two of 
the principal axes. Refer to Fig. 7-19, which is a perspective drawing with two principal vanishing 
points. In the case where the projection plane intersects the x and y axes, for example, the normal vector 
satisfies the relatioinship N ■ K = 0 or n } = 0, and so the principal vanishing points are 


p v 


x. = a -| 

X\ =b 
Z\ = c 


Pv 


x 2 = a 

. d 
v 2 = b - 1 



Fig. 7-20 
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(c) The three-vanishing-point perspective projection occurs when the projection plane intersects all three of 
the principal axes — x, y, and z axes. Refer to Fig. 7-20, which is a perspective drawing with three 
principal vanishing points. In this case, the principal vanishing points are points P\ . P 2 . and P 3 from 
Prob. 7.6(b). 


7.8 VVliat are the principal vanishing points for the standard perspective transforamtion? 

SOLUTION 

In this case, the view plane normal N is the vector K. From Prob. 1.6(b), since N • I = 0 and N • J = 0, 
there are no vanishing points in the directions I and J. On the other hand, N K = K K = I. Thus there is 
only one principal vanishing point, and it is in the K direction. From Prob. 7.1(a), the coordinates of the 
principal vanishing point VP in the K direction are 

x = a — 0 y = b = 0 z = —d + y = 0 
So VP = (0, 0, 0) is the principal vanishing point. 

7.9 An artist constructs a two-vanishing-point perspective by locating the vanishing points VP l and VP 2 
on a given horizon line in the view plane. The horizon line is located by its height h above the 
ground (Fig. 7-21). Construct the corresponding perspective projection transformation for the cube 
shown in Fig. 7-21. 



SOLUTION 

A two-principal-vanishing-point perspective must intersect two axes, say, x and v. We locate the view 
plane at the point R { ,( 1 , 1, 0) so that it makes angles of 30° and 60° with the corresponding faces of the cube 
(see Fig. 7-21). In this plane w'e locate the horizon line a given height h above the “ground” (the xy plane). 
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The vanishing points VP t and VP 2 are located on this horizon line. To construct the perspective 
transformation, we need to find the normal vector N = //)! + n 2 J + h 3 K of the view plane, the coordinates 
C(a,b,c) of the center of projection, and the view parameters d 0 , i/, , and d (Prob. 7.5). To calculate the 
coordinates of the vanishing points, we first find the equation of the horizon line. Let l x and 1 2 be the points of 
intersection of the view plane and the x and y axes. The horizon line is parallel to the line /,/ 2 and lies h units 
above it. 

From triangles I x BR n and l 2 DR (h we find 

(\ + V3 

V V3 

The equation of the line through /, and I 2 (App. 2) is 

' = “ (^) • ,=(l+ ' /5) ' ! =° 

This line lies in the view plane. So if the equation of the horizon line is then taken to be a line parallel to this 
line and h units above it, the horizon line is guaranteed to be in the view plane. The equation of the horizon 
line is then 


, 1 = ( , + J J ,o,°) 


,0.0) and /, = (0, l + -Z3. 0) 




(1 — t) y = (1 + s/l)t z = h 


The vanishing points VPj and VP 2 are chosen to lie on the horizon line. So VP\ has coordinates of the form 

EP, = ^i±pj(l -,,).(! and VP 2 = - t 2 ), (1 + V3)/ 2 , /,J 

(Here, /, and t 2 are chosen so as to place the vanishing points at the desired locations.) 

To find the normal vector N and the center of projection C, we use the equations in Prob. 7.6, pail (/>) for 
locating the vanishing points of a given perspective transformation. So 




1 + \/T\ 


and 


Using the values 


b = ( 1 4- \/3 )<| and 

'l + vT 


b H = (1 + V3)l 2 


and 


c = h 


a = * = ( , +v / 3)/| 


c = li 


and then substituting, we find 


and 


d ( 1 + \/3V 


(7.1) 


— = ( 1 + s/3)(f2 — t\ ) 
«2 


(7.2) 


Since the plane does not intersect the z axis, then N • K = 0, or using components: » 3 = 0. Finally, we choose 
the normal vector N to be of unit length: 


ini = y«i + n\ + «i = 4- «2 = 


From equations (7.1) and (7.2) 


ds / 3 


(1 + V3)(t 2 — t\ 


n 2 -- 


(1 + n/ 3)(/ 2 — fi) 
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(1 + \/ 3) 2 (^2 — h ) 1 ' 0 + 73 ) V 2 -/,) 2 


Id 1 + 73 

= =1 and so d = — - — (t 2 — t,) 

(1 + 75 Xti-tO 2 


= -nj, _ and _ » = - 

I + y3 2 2 1 + 73 2 


Finally, we have 


73[<1 + 73)/2J _ 73 
1 + 73 2 

/75l+75V 


(l+73)/2 _l 
1 + 73 2 

- 1 + 73 r . 


d\ =«, a + n 2 /> + = f 2 3 1 7f~ )^ - ^ + 5^ 1 + = * + 9 3 [* “(*2 _f l)] 


i/(j = -f- </] = 


1 + 73 


From Prob. 7.5, the perspective transformation matrix is then 


P er N,R„,C ~ 


1 + 73 


l-h 

>-' 2) 0 


73 f, 

t 2 0 

-(1 + 73)1, 

75* 

h 

r~ *2 — 

1 + 73 


1 + 73 

— // 

73 

1 o 

-[1 - (h ~ h)] 

1 + 73 

1+73 


In Chap. 8, Prob. 8.2, it is shown how to convert the transformed image of the cube into x, y coordinates for 
viewing. 

7.10 Derive the equations of parallel projection onto the xy plane in the direction of projection 
V = al + bJ + cK. 

SOLUTION 

From Fig. 7*22 we see that the vectors V and PP' have the same direction. This means that PP' = k\. 
Comparing components, we see that 

x? — x = ka v — v = kb z — z = kc 


r ~ t U , / b 

k = — x = x — z and y = v — ; 
c c c 


In 3 x 3 matrix form, this is 


0 -- 

c 


0 1 — 

c 

0 0 0 


and so P' = Pai\ ■ P. 
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Fig. 7-22 


7.1 1 Derive the general equation of parallel projection onto a given view plane in the direction of a given 
projector V (see Fig. 7-23). 



SOLUTION 

We reduce the problem to parallel projection onto the xv plane in the direction of the projector 
V = u\ 4- bJ + cK by means of these steps: 

1 . Translate the view reference point R 0 of the view plane to the origin using the translation matrix T_ r<i . 

2. Perform an alignment transformation A N so that the view normal vector N of the view plane points in the 
direction K of the nonnal to the xy plane. The direction of projection vector V is transfonned to a new' 
vector V' = V. 

3. Project onto the xy plane using Par v . 
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4. Perform the inverse of steps 2 and 1 . So finally Par y ^ Ro = Tig ■ ■ Par v ■ A N ■ T_ Rg . From what we 

learned in Chap. 6, we know that 

/I 0 0 -x 0 \ 

_ 0 1 0 — >'o 

0 0 1 -zo 

i^O 0 0 1 / 

and further from Chap, 6, Prob. 6.2, w'here A = y/n\ + n\ and X # 0, that 


X 

-«1»2 

-«!« 3 

!N| 

A|N| 

A|N| 

0 

n 3 

-« 2 

X 

X 

«i 

"2 

"3 

|N| 

|N| 

INI 


\ 0 0 0 1 / 


Then, after multiplying, we find 




d x — an x 

—an-, 

— an 3 

ad 0 \ 

—bn t 

d | — bn 2 

-b»3 

K 

—cn i 

—cn 2 

rf) — cn 3 

cd u 

0 

0 

0 

4 ) 


Here = n t x () + n 2 . v n + n i z o and d x = n x a + n 2 b + n 2 c. An alternative and much easier method to 
derive this matrix is by finding the intersection of the projector through P w r ith the equation of the view 
plane (see Prob. A2.14). 


7.12 Find the general form of an oblique projection onto the xv plane. 

SOLUTION 

Refer to Fig. 7-24. Oblique projections (to the xy plane) can be specified by a number/ and an angle 0. 
The number f prescribes the ratio that any line L perpendicular to the xy plane will be foreshortened after 
projection. The angle 0 is the angle that the projection of any line perpendicular to the xv plane makes with the 
(positive) x axis. 



Fig. 7-24 
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To determine the projection transfonnation, we need to find the direction vector V. From Fig. 7-24, with 
line L of length I , we see that the vector P'P has the same direction as V. We choose V to be this vector: 

V = F? = xT+/J-K (=al+bJ + cK) 

From Fig. 7-24 we find a = x! =fcosO, b = / =/sin0, and c = — 1. 

From Prob. 7.10, the required transformation is 


Par v = 


1 0 fcosf) 0\ 

0 1 /sin 0 0 

0 0 0 0 
0 0 0 1 / 
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The image coordinates are then 

A' = (0, 0, 0) 


« = (i,o,o) r-(££.) 

C = (1.1.0) c'=(i 


a = (0.i.o) h' 


Refer to Fig. 7-25. 

To draw the cabinet projection: 


■ V : 


/ sfi> sfi> >fi> 

' 0 1 1 0 —— — — \+^~ 

4 4 4 

0 0 , 1 |i i I 

4 4 4 

0 0 0 0 0 0 0 

\ 1 1 1 1 1 1 1 


The image coordinates are then (see Fig. 7-26) 

A' = (0, 0. 0) 
ff =( 1 , 0 , 0 ) 
C' = (1.1,0) 
D' = (0, 1.0) 


+ 


I 


>/3\ 


4 
0 

I / 


*-(£*•) 
"i' + f'i°) 
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J ► 

i x 


Fig. 7-25 


Fig. 7-26 
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7.14 Construct an isometric projection onto the xy plane. Refer to Fig. 7-27. 



Fig. 7-27 


SOLUTION 


We shall find a “tilting” of the x, v, z axes that transforms the IJK vector tried to a new set I'J'K whose 
orthographic projections onto the xy plane produce vectors of equal lengths. 

Denoting the tilting transformation by T and the orthographic projection onto the xv plane by Par K , the 
final projection can be written as Par = Par K • T , where Par K is as defined in Example 3 and T is as defined 
in Prob. 6.1 in Chap. 6. Multiplying, we find 


Par = 


( cos 6 y 
0 
0 
0 


sin 6 y sin 0 X 
cos 9 X 
0 
0 


sin 0 y cos 6 X 
—s'md x 
0 
0 


0\ 

0 

0 

1/ 


Now 


Par ■ 1 = (COs6 y , 0. 0) Par ■ J = (sin(l v sin 6 X , cos 6 X , 0) Par • K = (sin 0 V cos 6 X , — sin0 T , 0) 

(the projections of the vectors 1, J, and K). To complete the specification of the transformation M, we need to 
find the angles 6 X and 0 V . To do this, we use the requirement that the images Par ■ I, Par • J, and Par • K are to 
all have equal lengths. Now 


and 


| Par ■ I| = ^Jcasr 6 y | Par ■ J| = y4in 2 9 y sin 2 6 t + cos 2 8 X 

| Par ■ K| = y^sin 2 6 y cos 2 0 X + sin 2 8 X 


Setting | Par ■ J| = | Par ■ K| leads to the conclusion that sin 2 6 X — cos 2 0, = 0 and to a solution 6 X = 45° (and 
so sin 8 X = cos 0 X = sfl/2). Setting | Par • I| = | Par ■ J| leads to cos 2 0 y = ^ (sin 2 0 y + 1). Multiplying both 
sides by 2 and adding cos 2 0 r to both sides gives 3 cos 2 B y =2 and a solution is 6 V = 35.26° (and so 
sin () y = yi /3, cos 9 y = s/2/3). Finally 

/ [2 l_ [2 1 12 A 
V 3 2 V 3 2 V 3 


Par = 


0 


0 
0 
V 0 


s/2 

2 

0 

0 


s[2 
' 2 
0 

0 


0 

0 

1/ 
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7.15 Construct a dimetric projection onto the xy plane. 

SOLUTION 


Following the procedures in Prob. 7.14, we shall tilt the .v, v, z axes and then project on the xy plane. We 
then have, as before, 

| Par ■ I| = Jcos 2 6 y \Par ■ J| = yjsin 2 0, sin 2 0 X + cos 2 0 X 
and 


| Par ■ K| = ,/sin 2 6 y cos 2 8 X + sin' 6 X 


To define a dimetric projection, we will specify the proportions 

| Par ■ I| : | Par • J| : \Par • K| = /: 1 : 1 (/ ^ 1) 

Setting | Par ■ J| = | Par ■ K|, we find sin 2 6 X — cos 2 0 X = 0 and 0 X = 45°, so sin 0 t = cos 9 X = s/2/2. Setting 
| Par ■ 1 1 = l\Par ■ J| gives 


7 l 2 7 
cos 2 Oy = — [sin - By 4- 1] 

Multiplying both sides by 2 and adding / 2 cos 2 B y to both sides gives 

(2 + 1 2 ) cos 2 0 y = 2/ 2 


(7.3) 


So 


cos d y = l V 2 + p 


Frotn equation (7.5) we can also find 


2 — l 2 p 

Si " 2 9y = 2 + P and S ' n = V 2+P 


(Note the restriction I < s/2.) Thus 


( , n~ ^ I 2 - ii vs 0 \ 

V2 + P 2 X 2+P 2 V 2 + P 


Par = 


s/2 

0 T- 


-s/2 


0 

V o 


2 0 

0 0 

0 \) 


and 0 < / < s/2. 

Note that any other projection ratio, say, 1:1:7, can be achieved by performing an appropriate rotation 
before applying Par. In this example, a rotaiton of 90° about the y axis aligns the z axis with the x axis so that 
Par can be applied. 


Supplementary Problems 

7.16 Construct a perspective transformation given three principal vanishing points and the distance D from the 
center of projection to the projection plane. 

7.17 Draw the (a) isometric and ( b ) dimetric projections of the unit cube onto the ,yv plane. 

7.18 How many view planes (at the origin) produce isometric projections of an object? 



CHAPTER 8 



Three-Dimensional 
Viewing and Clipping 

An important step in photography is to position and aim the camera at the scene in order to compose a 
picture. This parallels the specification of 3D viewing parameters in computer graphics that prescribe the 
projector (the center of projection for perspective projection or the direction of projection for parallel 
projection) along with the position and orientation of the projection/view plane. 

In addition, a view volume defines the spatial extent that is visible through a rectangular window in the 
view plane. The bounding surfaces of this view volume is used to tailor/clip the objects that have been 
placed in the scene via modeling transformations (Chaps. 4 and 6) prior to viewing. The clipped objects are 
then projected into the window area, resulting in a specific view of the 3D scene that can be further mapped 
to the viewport in the NDCS (Chap. 5). 

In this chapter we are concerned with the specification of 3D viewing parameters, including a viewing 
coordinate system for defining the view plane window, and the formation of the corresponding view 
volume (Sec. 8.1). We also discuss 3D clipping strategies and algorithms (Sec. 8.2). We then summarize 
the three-dimensional viewing process (Sec. 8.3). Finally, we examine the operational organization of a 
typical 3D graphics pipeline (Sec. 8.4). 


8.1 THREE-DIMENSIONAL VIEWING 

Three-dimensional viewing of objects requires the specification of a projection plane (called the view 
plane), a center of projection ( viewpoint ) or the direction of projection, and a view volume in world 

coordinates. 


Specifying the View Plane 

We specify the view plane by prescribing (1) a reference point R 0 (x 0 , y 0 , z 0 ) in world coordinates and 
(2) a unit nonnal vector N = +« 2 J + « 3 K, |N| — 1, to the view plane (see Fig. 8-1). From this 

information, we can construct the projections used in presenting the required view with respect to tire given 
viewpoint or direction of projection (Chap. 7). 
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View Plane Coordinates 

The view’ plane coordinate system or viewing coordinate system can be specified as follows: ( 1 ) let the 
reference point R 0 (x 0 ,y 0 , z 0 ) be the origin of the coordiante system and (2) determine the coordinate axes. 
To do this, we first choose a reference vector U called the up vector. A unit vector J (| can then be 
determined by the projection of the vector U onto the view plane. We let the vector J q define the direction 
of the positive q axis for the view plane coordinate system. To calculate J q , we proceed as follows: with N 
being the view plane unit nonnal vector, let U q = U — (N • U)N (App. 2, Prob. A2.14). Then 


J 


<i 



|U,I 


is the unit vector that defines the direction of the positive q axis (see Fig. 8-2). 



Fig. 8-2 
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Finally, the direction vector I of the positive p axis is chosen so that it is perpendicular to J (( , and, by 
convention, so that the triad I p , J , and N form a left-handed coordinate system. That is: 

_ N x J„ 

P IN x J,| 

This coordinate system is called the view plane coordinate system or viewing coordinate system. A left- 
handed system is traditionally chosen so that, if one thinks of the view plane as the face of a display device, 
then with the p and q coordinate axes superimposed on the display device, the normal vector N will point 
away from an observer facing the display. Thus the direction of increasing distance away from the observer 
is measured along N [see Fig. 8-3 (a)]. 



EXAMPLE 1. If the view plane is the xy plane, then I p = I, J q = J, and the unit normal N = — K form a left- 
handed system. The z coordinate of a point measures the depth or distance of the point from the view plane. The sign 
indicates whether the point is in front or in back of the view plane with respect to the center or direction of projection. 
In this example, we change from right-handed world coordinates (x, y, z) to left-handed view plane coordinates 
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(x\ /, :') [see Fig. 8-3(6)] be performing the transformation: 


xf —X 



In matrix form, for homogeneous coordinates: 

/ 1 0 0 0 \ 

o 1 0 0 

1rl 0 0-10 
^0 0 0 1 f 

The general transformation for changing from world coordinates to view plane coordinates [see Fig. 8- 
3(c)] is developed in Prob. 8.3. 


Specifying the View' Volume 

Tire view' volume bounds a region in world coordinate space that will be clipped and projected onto the 
view plane. To define a view volume that projects onto a specified rectangular window defined in the view' 
plane, we use view plane coordinates (p, q) v to locate points on the view plane. Then a rectangular view 
plane window is defined by prescribing the coordinates of the lower left-hand comer L(p mm , and 

upper right-hand comer R(p max , <7max)i> (see Fig. 8-4). We can use the vectors I p and J q to find the 
equivalent world coordinates of L and R (see Prob. 8.1). 

For a perspective view, the view volume, corresponding to the given window, is a semi-infinite 
pyramid, with apex at the viewpoint (Fig. 8-5). For view's created using parallel projections (Fig. 8-6), the 
view volume is an infinite parallelepiped with sides parallel to the direction of projection. 



Fig. 8-4 
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8.2 CLIPPING 

Clipping against a Finite View Volume 

The view volumes created above are infinite in extent. In practice, we prefer to use a finite volume to 
limit the number of points to be projected. In addition, for perspective views, very distant objects from the 
view plane, when projected, appear as indistinguishable spots, while objects very close to the center of 
projection appear to have disjointed structure. This is another reason for using a finite view volume. 

A finite volume is deliminated by using front (near) and back (far) clipping planes parallel to the view 
plane. These planes are specified by giving the front distance/ and back distance b relative to the view 
plane reference point R 0 and measured along the normal vector N. The signed distance b and / can be 
positive or negative (Figs. 8-7 and 8-8). 


Clipping Strategies 

Two differing strategies have been devised to deal with the extraordinary computational effort required 
for three-dimensional clipping: 

1 . Direct clipping. In this method, as the name suggests, clipping is done directly against the view 
volume. 

2. Canonical clipping. In this method, normalizing transformations are applied which transform the 
original view volume into a so-called canonical view volume. Clipping is then performed against 
the canonical view volume. 
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Fig. 8-8 Parallel view volume. 


The canonical view volume for parallel projection is the unit cube whose faces are defined by the 
planes x = 0, x = 1 , y = 0, v = 1 , z = 0, and z = 1 . Tire corresponding normalization transformation N par 
is constructed in Prob. 8.5 (Fig. 8-9). 



The canonical view volume for perspective projections is the truncated pyramid whose laces are 
defined by the planes x = z, x — — z, y — z,y — — z, z — Zj-, and z - 1 (where z f is to be calculated) (Fig. 8- 
10). Tire corresponding normalization transformation A' per is constructed in Prob. 8.6. 

Tire basis of the canonical clipping strategy is the fact that the computations involved such operations 
as finding the intersections of a line segment with the planes forming the faces of the canonical view 
volume are minimal (Prob. 8.9). This is balanced by the overhead involved in transforming points, many of 
which will be subsequently clipped. 

For perspective views, additional clipping may be required to avoid the perspective anomalies 
produced by projecting objects that are behind the viewpoint (see Chap. 7). 


Clipping Algorithms 

Three-dimensional clipping algorithms are often direct adaptations of their two-dimensional counter- 
parts (Chap. 5). The modifications necessary arise from the fact that we are now clipping against the six 
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A 


v 



Figure 8-10 


faces of the view volume, which are planes, as opposed to the four edges of the two-dimensional window, 
which are lines. 

Tire technical differences involve: 

1. Finding the intersection of a line and a plane (Prob. 8.12). 

2. Assigning region codes to the endpoints of line segments for the Cohen- Sutherland algorithm 
(Prob. 8.8). 

3. Deciding when a point is to the right (also said to be outside) or to the left ( inside ) of a plane for 
the Sutherland-Hodgman algorithm (Prob. 8.7). 

4. Determining the inequalities for points inside the view volume (Prob. 8.10). 

8.3 VIEWING TRANSFORMATION 
Normalized Viewing Coordinates 

We can view' the normalizing transformations N pai and N pet from Sec. 8.2, under “Clipping 
Strategies,” as geometric transformations. That is, Obj is an object defined in the world coordinate 
system, the transformation 

Ob f = N pai ■ Obj or Obj' = N ^ ■ Obj 

yields an object Obj' defined in the normalized viewing coordinate system. 

Canonical clipping is now equivalent to clipping in normalized viewing coordinates. That is, the 
transformed object Obj' is clipped against the canonical view volume. In Chap. 10, where hidden-surface 
algorithms are discussed, it is assumed that the coordinate description of geometric objects refers to 
normalized viewing coordinates. 


Screen Projection Plane 

After clipping in viewing coordinates, we project the resulting structure onto the screen projection 
plane. This is the plane that results from applying the transformations N pitr or ,V per to the given view' plane. 
In the case N pm ., from Prob. 8.5, we find that the screen projection plane is the plane z — —f /(b —f) and 
that the direction of projection is that of the vector K. Thus the parallel projection is orthographic (Chap. 
7), and, since the plane z——f /(b -/) is parallel to the xy plane, we can choose this latter plane as the 
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projection plane. So parallel projection Par in normalized viewing coordinates reduces to orthographic 
projection onto the xy plane. The projection matrix is (Chap. 7, Sec. 7.3) 


/I 

0 

0 

0\ 

0 

1 

0 

0 

0 

0 

0 

0 


0 

0 

1/ 


In the case of perspective projections, the screen projection plane is the plane z = 4(4 + 4) (Prob. 8.6). 
The transformed center of projection is the origin. So perspective projection Per in normalized viewing 
coordinates is accomplished by applying the matrix (Chap. 7, Prob. 7.4) 


/-At o 

d. + b 


Per = 


0 

0 

( 0 


c', + b 

0 

0 


0 

0 

4 

d. + b 

1 


0^ 

0 

0 

0 ) 


Constructing a Three-dimensional View 

The complete three-dimensional viewing process (without hidden surface removal) is described by the 
following steps: 

1 . Transform from world coordinates to normalized viewing coordinates by applying the transfor- 
mations V p . ir or N^. 

2. Clip in normalized viewing coordinates against the canonical clipping volumes. 

3. Project onto the screen projection plane using the projections Par or Per. 

4. Apply the appropriate (two-dimensional) viewing transformations (Chap. 5). 

In terms of transformations, w'e can describe the above process in terms of a viewing transformation 
V T , where 

V T = V 2 ■ Par ■ CL • A par or V T = V 2 - Per ■ CL ■ N per 
Here CL and V 2 refer to the appropriate clipping operations and two-dimensional viewing transformations. 


8.4 EXAMPLE: A 3D GRAPHICS PIPELINE 

The two-dimensional graphics pipeline introduced in Chap. 5 can non be extended to three dimensions 
(Fig. 8-11), where modeling transformation first places individually defined objectes into a common scene 
(i.e. the 3D WCS). Viewing transformation and projection are then carried out according to the viewing 
parameters set by the application. The result of projection in the view plane window' is further mapped to 


Modeling 

Object ► Transformation 
Definition (CTM) 


Viewing 

Transformation 



viewing parameters 


Display 
(frame buffer) 


Fig. 8-11 A 3D graphics pipeline 
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the appropriate workstation viewpoint via 2D viewing transformation and scan-converted to a discrete 
image in the frame buffer for display. 

An application typically specifies the method of projection and the corresponding view volume with 
system calls such as 

perspective (or, <i H „ z b , z b ) 

w'here the viewpoint of perspective projection C is assumed to be at the origin of the WCS and the 
perspective view volume centers on the negative z axis (aw'ay from the viewer); a. denotes the angle 
between the top and bottom clipping planes, a w the aspect ratio of the view plane window, Zy the distance 
from C to the front clipping plane (which is essentially also the view plane), and z b the distance from C to 
the back clipping plane. 

On the other hand, orthographic parallel projection can be specified by 

orthographic (.v lllin , .v max , y^, y nlax , z f , z h ) 

where the direction of projection is along the negative z axis of the WCS; the first four parameters of the 
call define the left, right, bottom, and top clipping planes, respectively; and the role of z, and z b remains the 
same as in the perspective case above. 

Other calls to the system library often provide additional functionality. For example, the center of 
perspective projection can be placed anywhere in the WCS by a call that looks like 

lookat (x c ,y c ,z c ,xp,y p ,z p ) 

where x c ,y c ,z c are the coordinates of C and x p , v p , z p are the coordinates of a reference point P — the 
perspective of view volume now' centers on the line from C to P. The y axis of the WCS, or more precisely, 
vector J, serves as the up vector that determines I p and J q . Ait additional parameter may be included to 
allow rotation of the viewing coordinate system ( with R 0 = C w , the center of the view plane window) about 
its z axis, i.e. line CP. 

Using perspective! ) and lookat( ), we can conveniently produce a sequence of images that animate a 
“walk-by” or “fly-by” experience by placing P on an object and moving C along the path of the camera 
from one frame to the next (Fig. 8-12). 



■y c, 

TC, 

>C, 


Fig. 8-12 

Finally, we w'ant to note a couple of crucial operations of the 3D graphics pipeline that have not yet 
been discussed. The first is to prevent objects and portions of objects that are hidden from the viewer’s 
eyesight from being included in the projected view (Chap. 10). The second is to assign color attributes to 
pixels in a way that makes the objects hi the image look more realistic (Chap. 1 1). 


Solved Problems 


8.1 Let P(p, q), be the view plane coordinates of a point on the view plane. Find the world coordinates 
P(x, y, z) u of tlie point. 
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SOLUTION 

Refer to Fig. 8-13. Let R 0 be the view plane reference point. Let R be the position vector of R u and W the 
position vector of P, both with respect to the world coordinate origin (see Fig. 8-13). Let V be the position 
vector of P with respect to the view plane origin R () . Now 

V = pl p + qj q and W = R + V 



So 


W = R + pip + qi q 

Let the components of I p and J (| be 

Ip = a pl + b p J + J q = a ? I 4- + c (? K 


Also 


R — .r 0 I + v'o J + c () K 

and so from W = R + /d p + qi^ we find 

VV = (x„ + pa p + qa q )\ + (v 0 +pb p + qb q )3 + (z 0 + pc p +qc q ) K 
The required world coordinates of P can be read off from W: 

P(x 0 + pa p + qa q , v 0 + pb p + qb q , z 0 + pc p + qc q ) K 


8.2 Find the projection of the unit cube onto the view plane in Prob. 7.9 in Chap. 7. Find the 
corresponding view plane coordinates of the projected cube. 
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SOLUTION 


Following Prob. 7.9, we must specify several parameters in order to calculate the corresponding 
perspective projection matrix Per^ R C . Choosing h=\,t\ =1, and t 2 = (1 — 75)/(l + s/3), we obtain 


P er N.R„,C 


0 

1 + ^3 

0 

—2 

75 

1-73 

1+75 

0 

-(1 + 73) 

75 

1 

-273 

1 

2(1 + s/3) 

2(1 + 73) 

1 + \/3 

2 

vf 

1 

0 

(\ +373' 


V 1 + ^3 1+75 \l + y/5)j 


Applying Per NR c to the matrix V of homogeneous coordinates of the unit cube, we have Per N » c • V = V', 
where V is the matrix (A ’ B'C 17 E' F G'H’). After matrix multiplication, we have 


V' = 


1 + 75 

— - — x 


_2 

_2 

-273 

— 2\/3 

1 + 73 

1 + 73 

-(1 + 73) 

-1 

-273 

1 + 73 

-3 

1 

-1 

0 

-73 

2 

2(1+73) 

2(1 + 73) 

/I +3s/3\ 

/l+273\ 

-273 

-373 

\ 1 + 75 / 

V l + 73 7 

1+73 

(1 + 73) 


-273 

1 + 73 

—2 

-2 

-275 \ 
1 + 75 

_3 

-(1+73) 

-1 

-275 


1 + 75 

-573 

/l+575\ 

/ 1 +4s/3 \ 

-2 75 

2(1 + 73) 

\2(1 + 73/ 

\2(1 + 75/ 

1 + 75 

-373 

/l+373\ 

(\ +2 75\ 

-275 

(1 + 73) 

V i + 75/ 

V i +75/ 

1 + 75/ 


Changing from homogeneous coordinates to world coordinates (App. 2), we find the coordinates of the 
projected cube to be 


A' 


/2+ 75\ 

1 + 75 ‘ 

E .n i + VJ 5\ 

Vi+37 V 

’2(1+375). 

\3 ’ 75 ’6j 


1 + 75\ 1 + 75 


.VI +2 

C\ 1, 1,0) 

72 1 -E s/3 P 


1+275*2(1 + 


2V3)J L\l+375/ 1 


iy 


n 1 + 73 i\ 
\3 75 6/ 


1 + 73 \ 2(2 4- 75) 1+575 


G' 


+ 3 75/ 1+375 ’ 2(1 + 375). 

l+s/3\ l + v'i 1 +4v/3 


?(±±^L) 

l \i +isH>y 


I + 2757(1 + 


W3_' 

2v/3). 


H\ 1.1, 1) 


To change from world coordinates to view plane coordinates, we first choose an up vector. Choosing the 
vector K, the direction of the positive z axis, as the up vector, we next find the view plane coordinate vectors I p 
and J q . 

With our choices /, and t 2 , we find that the unit normal vector N (Prob. 7.9) is 




Choosing U = K, and using Prob. A2.14 (App. 2), we find that 

U q = U - (N - U)N = U (since N • U = 0) = K and 


|U,| 


= K 
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(Note to student using equation (A2-3) of Prob. A2.14: we have used the fact that |N| = 1 and replaced V p 
with IL and V and U.) 

Now 

_ Nx J q 
p |NxJ q | 


Calculating (App. 2), we obtain 


73 


So 


N x J q = ~ 1 J and |N x J q | = 1 


i 

V “2 2 


To convert a point P with world coordinates (x, v, z) n to view plane coordinates (p, q) 0 , we use the equations 
from Prob. 8.1: 


x = x 0 +pa p + qa 1 V = v n + pb p + qb q z = z 0 + pc p + qc q 
where (x 0 , v 0 , z 0 ) are the coordinates of the view plane reference point R {) . Now 

l„ =a p l + b p i+c p K = ‘ I-^J + OK J q = a q l + b q J + c q K = 0I+0J + IK 
Choosing R 0 ( 1 , 1 , 0) as the view plane reference point, we find 

I -73 

X—-P+ 1 V = — r — p + 1 z — q 

Solving for p and q, we have 

p ~ 2(x — 1) and q = z 


Using these equations, we convert the transformed coordinates to view plane coordinates: 
A' 


/'-V3N 

1+73 ' 

Vl +373/ 

'2(1 +373)J 


B 


I +2^3 '2(1 + 273)J 

C'(0, 0) 




«-M) 


G' 


2(1+373)] 
2 . 1 +473 1 


1 +273 '2(1 +273)J 

H\ 0, 1 ) 


Refer to Fig. 8-1 4. Note also that the coordinates of the view point or center of projection C and the vanishing 



Fig. 8-14 
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points VP\ and VP 2 can be found by using the equations from Prob. 7.9: 


C(a, b.c) = c(l, l + s/3. 0 VP l (o, 1 + s/3, J-j VP 2 ( 2 , 1 - s/3. 


In view plane coordinates: 


VP 


K)„ 


and 


VP 




8.3 Find the transformation T wv that relates world coordinates to view plane coordinates. 

SOLUTION 

The world coordinate axes are determined by the right-handed triad of unit vectors [I. J, K], 

The view' plane coordinate axes are determined by the left-handed triad of vectors [I p , J q , N] and the view- 
reference point R f )(x 0 , v’o.Co). 

Referrring to Fig. 8-3(a), we construct the transfonnation T wv through the concatenation of the matrices 
determined by the following steps: 

1. Translate the view plane reference point /?o(jc 0 ,yo,2o) t0 hie world coordinate origin via the translation 
matrix T v . Here V is the vector with components — x„I — v 0 J — z 0 K. 

2. Align the view plane nonnal N with the vector — K (the direction of the negative z axis) using the 
transformation /1 N _ K (Chap. 6, Prob. 6.5). Let l p be the new position of the vector l p after performing the 
alignment transformation, i.e. 

_k ’ l p 

3. Rotate l p about the z axis so that it aligns with I, the direction of the x axis. With 0 being the angle 
between I p and I, the rotation is R BK (Chap. 6). 

4. Change from the right-handed coordinates to left-handed coordinates by applying the transfonnation Trl 

from Example 1 . Then T m = T rl ■ R (i k • ■ T y . If (x„„y w ,z w ) are the world coordinates of point P, 

the view plane coordinates (*,,, v v ,z 0 ) of P can be found by applying the transfonnation T m ,. 


8.4 Find the equations of the planes forming the view volume for the general parallel projection. 

SOLUTION 

The equation of a plane is determined by two vectors that are contained in the plane and a reference point 
(App. 2, Prob. A2.10). The cross product of the two vectors determines the direction of the nonnal vector to 
the plane. 

In Fig. 8-8, the sides of the window in the view plane have the directions of the view plane coordinate 
vectors I p and J q . With V as the vector detennining the direction of projection, we find the follow'ing planes: 

1 . Top plane — detennined by the vectors I p and V and reference point R r , measured / units along the unit 
nonnal vector N = n, 1 + w 2 J + n 3 K from the upper right comer R{r { , r 2 , r 3 ) of the window. Reference 
point Rj has world coordinates (r t + fn ] , r 2 +.fh 2 , r 3 +fn 3 ). 

2. Bottom plane — determined by the vectors I p and V and the reference point Lf, measured from the lower 
left comer /.(/, , / 2 . / 3 ) of the window. Point Lj has world coordinates (/, +/«, , l 2 4 -fh 2 , / 3 + /« 3 ). 

3. Right side plane — determined by the vectors J q and V and the reference point R f . 

4. Left side plane — determined by the vectors J q and V and the reference point L f . 

Front and back clipping planes are parallel to the view plane, and thus have the same nonnal vector 
N = «j I 4- n 2 J + m 3 K. 

5. Front (near) plane — determined by the nonnal vector N and reference point 

P/(x (l + fn | , y (l + /«,, c ( | -(- fn 3 ), measured from the view reference point i? 0 (x 0 , v 0 ,z 0 ). 
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6. Back (far) plane — determined by the nonnal vector N and reference point 

P h ( x 0 + bn | , v 0 + bn 2 , z 0 -f bn } ), measured b units from the view plane reference point If. 


8.5 Find the normalizing transformation that transforms the parallel view volume to the canonical view 
volume determined by the planes x = 0, x = 1, y = 0, y = 1, z = 0, and z = 1 (the unit cube). 

SOLUTION 

Referring to Fig. 8-8, we see that the required transformation N pm . is built by perfbnning the following 
series of transfonnations: 


1 . Translate so that If, the view plane reference point, is at the origin. The required transformation is the 
translation T_ R . 

2. The vectors I p , J q , and N fonn the left-handed view plane coordinate system. We next align the view 
plane nonnal vector N with the vector — K (the direction of the negative z axis). The alignment 
transfonnation zf N _ K was developed in Chap. 6, Prob. 6.5. Let I p be the new position of the vector l p ; 
that is, l p = /i N _ K • l p . 

3. Align the vector I p with the vector I (the direction of the positive x axis) by rotating l p about the z axis. 
The required transformation is R s K . Here, 6 is the angle between I p and I (Chap. 6). When Rg K aligns I p 
with I, the vector J q (where J q = /1 N _ K • J q ) is aligned with the vector J (the direction of the positive y 
axis). 

4. We change from the right-handed world coordinate system to a left-handed coordinate system. The 
required orientation changing transfonnation is [see Fig. 8-3(6)] (see also Example 1) 

/ 1 0 0 0\ 

0 1 0 0 

1rl ~ 0 0 -1 0 

^00 01; 


5. Let V' be the new position of the direction of projection vector V ; that is, V' — Tm ■ Rft' k ■ A\,-k • V. The 
new position of the transformed view volume is illustrated in Fig. 8-15. Note how the view volume is 
skewed along the line having the direction of the vector V'. Suppose that the components of V' are 
V" = r'.1 4- (/ J + e'K. We now perfonn a shearing transformation that transfonns the newly skewed view 
volume to a rectangular view volume aligned along the z axis. The required shearing transformation is 
determined by preserving the new view volume base vectors I and J and shearing V' to the vector r!,K 
(the K component of V ); that is, I is transformed to I, J is transfonned to J, and V' is transformed to CK. 
The required transfonnation is the matrix 


(t 


0 


SI, = 


0 1 


\0 0 


0 

\f z 

1 > 


In order to concatenate the transfonnation so as to buld A |Wr , we use the 4 x 4 homogeneous fonn of Sh 

( 1 °\ 

Sh 1 0 

J 0 

\0 0 0 1 / 


6. We now translate the new view volume so that its lower left corner L',- will be at the origin. To do this, we 
note that the first four transfonnations correspond to the view plane coordinate system transformation in 
Prob. 8.3. So after performing these transformations, we find that the lower left comer of the view plane 
window L(p min , ^ lnin ) t , (view plane coordinates) transforms to a point L' on the xy plane whose coordinates 
arc (p min , q mm , 0). Similarly, the upper right corner R is transfonned to R (p m;K . q, m \ - 0)- After performing 
the shearing transfonnation Sh, we see that the view volume is aligned with the z axis and the back and 
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front faces are, respectively, b and / units from the xy plane. Thus the lower left corner of the view volume 
is at and the bounds of the view volume are p mm <x< p nm , q mm <v<q mm , 

f < z < b. The required translation is T_ L . 

7. We now scale the rectangular view volume to the unit cube. The base of the present view volume has the 
dimensions of the base of the original volume, which corresponds to the view plane window; that is 


»■ = Pom - Pmm (width) h = </ max - q mn (height) 


The depth of the new view volume is the distance from the front clipping plane to the back clipping 
plane: d = b — /. The required scaling is the matrix (in 4 x 4 homogeneous form) 

0 0 (A 




0-00 

h 


0 0-0 
a 

\ 0 0 0 1 / 


The required transfonnation is then 

Npar = ‘^1 /V, 1 /h,\/J ■ T_r • Sk ■ T R1 ■ P, ) K • v4n,_ k ' ^-R„ 


Note also that after performing the transfonnation N par , the view plane transforms to the plane 
z = parallel to the xy plane. Also, the direction of projection vector V transfonns to a 

vector parallel to the vector K having the direction of the z axis. 


8.6 Find the normalizing transfonnation that transforms the perspective view volume to the canonical 
view volume detennined by the bounding planes x — z, x — —z, y — z, y — —z, z — z f , and z = 1 . 
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SOLUTION 

Referring to Fig. 8-7, we build the normalizing transformation A' pcr through a series of transformations. 
As in Prob. 8.5: 

1 . Translate the center of projection C to the origin using the translation T_ c . 

2. Align the view plane normal N with the vector — K using /1 N _ K . 

3. Rotate I p to the vector I using the rotation /?,, K . (Recall that I p = .-I N _ K ■ I_.) 

4. We now change from right-handed world coordinates to left-handed coordinates by applying the 
transformation 

10 0 0 

0 1 0 0 

0 0-10 
0 0 0 1 

5. The newly transformed view volume is skewed along the centerline joining the origin (the translated 
center of projection) with the center of the (transformed) view plane window (Fig. 8-16). Let C„, be the 
coordinates of the center of the original view r plane window. Then C v has view plane coordinates 

( Pn:m ~f~ Pmax */miii "F nn, \ 

2 ’ 2 /„ 

These are changed to world coordinates as in Prob. 8.1. Let CC 1V be the vector from the center of 
projection to the center of the window'. Let (CC, t ,)' be the transformation of the vector CC„; that is, 
(CC„,)' = ■ R 6 k ■ /1 N _ K • CC W . Then (CC M ,)' is the vector that joins the origin to the center of the 

transfonned view' plane window (Fig. 8-16). Suppose that (CC„.)' = cjl + c' v J 4- clK. We shear the view 
volume so that it transforms to a view volume whose center line lies along the z axis. As in Prob. 8.5, the 




Fig. 8-16 
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required shearing transformation is 



0 

I 

0 

0 



The newly transformed window is, after applying the shearing transformation Sh, located on the z axis at 


6. Referring to Fig. 8-17, the transformed window is now centered on the z axis. The dimensions of the 
window are 


W = Pmax - Pm m (width) and h = </ m!LX - </ min (height) 

The depth of the new view volume is the distance between the front and back clipping planes: d = b — f. 
The transformed window is centered on the z axis at z c = c’ z and is bounded by 


W M' 

< x < — 

2 ~ ~ o 




Fig. 8-17 
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The transformed view plane is located at z c = cL. The transformed front clipping plane is located at 
z f = cl + /. The back clipping plane is now located at z h = cl + b. 

To transform this view volume into the canonical view volume, we first scale in the z direction so that 
the back-clipping plane is transformed to z = 1 . The required scale factor is 


The scaling matrix is 


a, — 


10 0 O' 


0 1 0 0 


.0 0 0 


To find the new window boundaries R" and L", we apply this scaling transformation to the present 
window coordinates 

Then 

„ (w h c' z \ „ ( h- b cL \ 

R ={r2'Ztb) “ d 

Next we scale in the x and y directions so that the window boundaries will be 

^ and &L- < , JC ) 

\ci + b 4 + b d,+bj V 4 + * 4 + b c't. + bj 

That is, the window' boimdaries will lie on the planes jc = z, x = —z, y = z ,and y = —z. The required 
scale factors are 


1 w(ci + b) 

The corresponding scaling transformation is 


and s v = 


h(c' z + b) 


rv(4 + b) 


0 0 0 


h(c'. + b) 


\ 0 001 / 

Multiplication of these scaling transfonnations into one transformation yields 


u'(ri + b) 


0 0 


h(c', + b) 


0 0 


0 


0 


0 
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To find the location of the front clipping plane, z F , we apply the transformation S s ^ s s to the present 
location of the center of the front clipping plane, which is C F ( 0. 0 , c' : + /). So 






That is 


cl +/ 

Z/ ~t + b 


The complete transformation can be written as 


Nper — ■ Sh ■ T m ■ R g k ■ d N _ K ■ T-c 

Note that after perfonning the transformation V |K1 , the view plane is transformed to the plane 

cl 

d,+b 

parallel to the xy plane. Also, the center of projection C is transformed to the origin. 


8.7 How do we determine whether a point P is inside or outside the view volume? 

SOLUTION 

A plane divides space into the two sides. The general equation of a plane is (App. 2) 

«i (* - -fr>) + ";(y - v 0 ) + n 3 (z - z 0 ) - 0 

We define a scalar function, f(P), for any point P(x,v, z) by 

f(P) =f(x,y, z) = ;;,(x - x 0 ) + n 2 (v - ,v 0 ) + n 3 (z - r 0 ) 

We say that a point P is on the same side (with respect to the plane) as point Q if sign f(P) = sign f(Q). 
Referring to Figs. 8-7 or 8-8, let and f F be the functions associated with the top, bottom, right, 
left, near (front), and far (back) planes, respectively (Probs. 8.4 and 8.10). 

Also, L and R are the lower left and upper right comers of the window and P b and P r are the reference 
points of the back and front clipping planes, respectively. 

Then a point P is inside the view volume if all the following hold: 

P is on the same side as L with respect to f T 
P is on the same side as R with respect to f B 
P is on the same side as L with respect to f R 
P is on the same side as R with respect to j] 

P is on the same side as P b with respect to f N 
P is on the same side as P f with respect to f F 

Equivalently 

sign/ r (P) = sign/ r (I) sign f L (P) = sign f L (R) 

sig nf B (P) = sign f B (R) sign/ v (P) = sign f N (P b ) 

sign/ ff (P) = sign. f R (L) sign f F (P) = sign f F (P f ) 

8.8 Show how region codes would be assigned to the endpoints of a line segment for the three- 
dimensional Cohen-Sutherland clipping algorithm for («) the canonical parallel view' volume and 
( b ) the canonical perspective view volume. 

SOLUTION 

The procedure follows the logic of the two-dimensional algorithm in Chap. 5. For three dimensions, the 
planes describing the view volume divide three-dimensional space into six overlapping exterior regions (i.e.. 
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above, below, to right of, to left of, behind, and in front of view volume), plus the interior of the view volume; 
thus 6-bit codes are used. Let P(x, y, z) be the coordinates of an endpoint. 

(a) For the canonical parallel view volume, each bit is set to true (1) or false (0) according to the scheme 

Bit 1 = endpoint is above view volume = sign (y — 1) 

Bit 2 = endpoint is below view volume = sign ( —») 

Bit 3 = endpoint is to the right of view volume = sign (x — 1 ) 

Bit 4 = endpoint is to the left of view volume = sign I —x) 

Bit 5 = endpoint is behind view volume = sign (z — 1) 

Bit 6 = endpoint is in front of view volume = sign (— z) 

Recall that sign (a) = 1 if a is positive, 0 otherwise. 

(b) For the canonical perspective view volume: 

Bit 1 = endpoint is above view volume = sign (v — z) 

Bit 2 = endpoint is below view volume = sign (— z —y) 

Bit 3 = endpoint is to the right of view volume = sign (x — z) 

Bit 4 = endpoint is to the left of view volume = sign (— z — x) 

Bit 5 = endpoint is behind view volume = sign (z — 1) 

Bit 6 = endpoint is in front of view volume = sign {z f - — z) 

The category of a line segment (Chap. 5) is (1) visible if both region codes are 000000, (2) not visible if 
the bitwise logical AND of the region codes is not 000000, and (3) clipping candidate if the bitwise 
logical AND of the region codes is 000000. 


8.9 Find the intersecting points of a line segment with the bounding planes of the canonical view 
volumes for {a) parallel and (b) perspective projections. 

SOLUTION 


Let Pjjx, , t-'i , z t ) and P 2 (x 2 ,.V' 2 " - 2 ) be the endpoints of the line segment. The parametric equations of the 
line segment are 

x=x, +(x 2 -x,)t ,v = V] +(>’ 2 -vi)r z = z, +(z 2 -z l )f 
From Prob. 8.1 1, the intersection parameter is 

-N 

' N P,P 2 


where N is the nonnal vector and R 0 is a reference point on the plane. 

(a) The bounding planes for the parallel canonical view' volume are x = 0, x = 1 , v = 0, y = 1, z = 0, and 
z = 1. For the plane x = 1, we have N = I and R 0 (1, 0, 0). Then 

_ -(x, - I) 

f/ ~ V _ V 

*2 A ! 


If 0 < tj < 1 , the line segment intersects the plane. The point of intersection is then 


x = x, + (x 2 — x, 



*1 ~ * 
x 2 — X, 


) 


1 




+ (-2--l) 


V 


(- 


=y\ +(v 2 — y ] ) ^ 


^2 



The intersections with the other planes are found in the same way. 

(h) The bounding planes for the perspective canonical view volume are x = z, x = — z, y = z, y = — z, 
z = Zr, and z = 1 (where z ( is calculated as in Prob. 8.6). 
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To find the intersection with the plane x = z, for example, we write the equation of the plane as 
x — z = 0. From this equation, we read off the normal vector as N = I — K (App. 2, Prob. A2.9), and the 
reference point is R 0 ( 0, 0, 0). Then 


(*2 _ - v i ) _ (-2 — - i ) 

If 0 < tj < 1 , we substitute t, into the parametric equations of the line segment to calculate the 
intersection point. 

The other intersections are found in the same way. 


8.10 Determine (he inequalities that are needed to extend the Liang Barsky line-clipping algorithm to 
three dimensions for (a) the canonical parallel view volume and (/? ) the canonical perspective view 
volume. 

SOLUTION 

Let P | (x, , y t , Z| ) and P 2 {x 2 ,y 2 , - 2 ) b £ ’ the endpoints of a line. The parametric representation of the line is 

I x — X] + Ax -u 
y =y\ -F Ay - u 
z = z, + A zu 

where 0 < u < 1 , At = x 2 — fy , Ay = >7 — , and A z =z 2 — z t . The infinite extension of the line 
corresponds to u < 0 and 1 < u. 

(a) Points inside the canonical parallel view volume satisfy 

*min < x, + Ax ■ u < x nrax 
Train — >'t *F A y * ^ — Tmax 
“min < 2] + A Z-U <2^ 

where X min TmiD “nun H and X max y.. 1:lx “max 1 • 

Rewrite the six inequalities as 

Pk * u < qk- k = 1 . 2, 3. 4, 5, 6 

where 


Pi = — Av, 

<7l =*1 -i'mn=h 

(left) 

tsJ 

II 

Er 

Vl = *max - *1 = 1 - *1 

(right) 

Pi = -Ay, 

#3 3^1 Amin 3^1 

(bottom) 

Pa = Ay, 

>raax 3 ; 1 1 >1 

(top) 

P 5 = “Az. 

*7 5 -1 -min “1 

(front) 

Pb = Az. 

?6 = -max - Z] = 1 - Zj 

(back) 


(b) Points inside the canonical perspective view volume satisfy (see Fig. 8-10). 

-z < X < z 
-z < y < z 
z f < z < 1 

i.e. 

— z, — A z u < X] + Ax • u < Z\ -F A z u 
— z, — Az ■ u < v t + Ay ■ 11 < Z] -F A zu 
Zy < z, -F Az • « < I 

Rewrite the six inequalities as 


Pk ■ « < </<•• 


k= 1.2,3. 4,5.6 
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where 


p | = — A.v — Ac 

<i\ = x \ + -i 

(left) 

p 2 = Ax — A z. 

</, =2, -X, 

(right) 

p } = — Ay — Ac. 

‘fi = .Vl +Z, 

(bottom) 

Pa = 4v - Az, 

=-i -y\ 

(top) 

Ps = -Az. 

<?5 = -1 ~ z f 

(front) 

p 6 = Az, 

<16 = • - -1 

(back) 


Supplementary Problems 


8.11 Find the equations of the planes forming the view volume for the general perspective projection. 

8.12 Find the intersection point of a plane and a line segment. 



CHAPTER 9 



Geometric 

Representation 


One of the major concepts in computer graphics is modeling of objects. By this we mean numerical 
description of the objects in terms of their geometric property (size, shape) and how they interact with light 
(reflect, transmit). The focus of this chapter is on geometric representation of objects. We will discuss 
illumination and shading models in subsequent chapters. 

A graphics system typically uses a set of primitives or geometric forms that are simple enough to be 
efficiently implemented on the computer but flexible enough to be easily manipulated (assembled, 
deformed) to represent or model a variety of objects. Geometric forms that are often used as primitives 
include, in order of complexity, points, lines, polylines, polygons, and polyhedra. More complex geometric 
forms include curves, curved surface patches, and quadric surfaces. 

9.1 SIMPLE GEOMETRIC FORMS 
Points and Lines 

Points and lines are the basic building blocks of computer graphics. We specify a point by giving its 
coordiantes in three- (or two-) dimensional space. A line or line segment is specified by giving its endpoints 
P,(v,, v'|,z,) and P 2 (x 2 ,y 2 ,z->). 


Polylines 

A polyline is a chain of connected line segments. It is specified by giving the vertices (nodes) 

P 0 P N defining the line segments. The first vertex is called the initial or starting point and the last 

vertex, the final or terminal point (see Fig. 9-1). 


Polygons 

A polygon is a closed polyline, that is, one in which the initial and terminal points coincide. A polygon 

is specified by its vertex list P 0 P N , P 0 . The line segments P u P t . P\P 2 P V P 0 are called the edges 

of the polygon. (In general, we need not specify P 0 twice, especially when passing the polygon to the 
Sutherland Hodgman clipping algorithm.) 
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Fig. 9-1 Fig. 9-2 


A planar polygon is a polygon in which all vertices (and thus the entire polygon) lie on the same plane 
(see Fig. 9-2). 


9.2 WIREFRAME MODELS 

A wirefram model consists of edges, vertices, and polygons. Here vertices are connected by edges, and 
polygons are sequences of vertices or edges. The edges may be curved or straight line segments. In the 
latter case, the wireframe model is called a polygonal net or polygonal mesh (Fig. 9-3). 




(a) Wire frame model. 


( b ) Hidden lines removed. 


Fig. 9-3 
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Representin'; a Polygonal Net Model 

There are several different ways of representing a polygonal net model. 

1. Explicit vertex list V = {P 0 ,P 1 ,P 2 , . . ■ , P N }. The points 2 y ) are the vertices of the 

polygonal net, stored in the order in which they would be encountered by traveling around the 
model. Although this fonn of representation is useful for single polygons, it is quite inefficient for 
a complete polygonal net, in that shared vertices are repeated several times (see Prob. 9.1). In 
addition, when displaying the model by drawing the edges, shared edges are drawn several times. 

2. Polygon listing. In this fonn of representation, each vertex is stored exactly once in a vertex list 

V = (P (] P N ), and each polygon is defined by pointing or indexing into this vertex list (see 

Prob. 9.2). Again, shared edges are draw'n several times in displaying the model. 

3. Explicit edge listing. In this fonn of representation, we keep a vertex list in which each vertex is 
stored exactly once and an edge list in w'hich each edge is stored exactly once. Each edge in the 
edge list points to the two vertices in the vertex list which define that edge. A polygon is now- 
represented as a list of pointers or indices into the edge list. Additional information, such as those 
polygons sharing a given edge, can also be stored in the edge list (see Prob. 9.9). Explicit edge 
listing can be used to represent the more general wireframe model. The wireframe model is 
displayed by drawing all the edges, and each edge is drawn only once. 


Polyhedron 

A polyhedron is a closed polygonal net (i.e., one which encloses a definite volume) in which each 
polygon is planar. The polygons are called the faces of the polyhedron. In modeling, polyhedrons are quite 
often treated as solid (i.e., block) objects, as opposed to wireframes or two-dimensional surfaces. 


Advantages and Disadvantages of Wireframe Models 

Wireframe models are used in engineering applications. They are easy to construct and, if they are 
composed of straight lines, easy to clip and manipulate through the use of geometric and coordinate 
transformations. However, for building realistic models, especially of highly curved objects, w'e must use a 
very large number of polygons to achieve the illusions of roundness and smoothness. 


9.3 CURVED SURFACES 

The use of curved surfaces allows for a higher level of modeling, especially for the constructiodn of 
highly realistic models. There are several approaches to modeling curved surfaces. One is an analog of 
polyhedral models. Instead of using polygons, we model an object by using small, curved surface patches 
placed next to each other. Another approach is to use surfaces that define solid objects, such as polyhedra, 
spheres, cylinders, and cones. A model can then be constructed w'ith these solid objects used as building 
blocks. This process is called solid modeling. 

There are two ways to construct a model — additive modeling and subtractive modeling. Additive 
modeling is the process of building the model by assembling many simpler objects. Subtractive modeling 
is the process of removing pieces from a given object to create a new object, for example, creating a 
(cylindrical) hole in a sphere or a cube. Subtractive modeling is akin to sculpting. 


9.4 CURVE DESIGN 

Given n + 1 data points, P 0 (x ti ,y 0 ) P„(x„,y„) we wish to find a curve that, in some sense, fits the 

shape outlined by these points. If we require the curve to pass through all the points, we are faced with the 
problem of interpolation. If w'e require only that the curve be near these points, we are faced with the 
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problem of approximation. Interpolation arises, for example, in reconstructing the shape of a digitized 
curved objects. Approximation is used in computer graphics to design curves that “look good” or must 
meet some aesthetic goal. To solve these often quite distinct problems, it is necessary to find ways of 
building curves out of smaller pieces, or curve segments, in order to meet the design criteria. (Note that 
curves and curve segments can be modeled as polylines, i.e., drawn with extremely short line segments.) 
When modeling a curve/) . y) by using curve segments, we try to represent the curve as a sum of smaller 
segments ®,-(x) (called basis or blending functions)-. 

fix) - £ a, <&,(*) 

7=0 

We choose these blending functions with an eye toward computation and display. For this reason, 
polynomials are often the blending functions of choice. 

A polynomial of degree n is a function that has the form 

Q(x) - a„x ” + a n _ jx" -1 H h a t x + a 0 

This polynomial is determined by its n -f- 1 coefficients [«„ a 0 ], 

A continuous piecewise polynomial Q(x) of degree n is a set of k polynomials qfx), each of degree n 
and k + 1 knots (nodes) t 0 , ... ,t k so that 

Q(x) = q,(x) for f < x < t i+l and / = 0, . . . , k - 1 

Note that this definition requires the polynomials to match or piece together at the knots, that is, 
qj_\(ti) — cj^tj), i=\,...,k—\. This requirement imposes no restrictions on how smoothly the 
polynomials qfx) fit together. For example, there can be comers or sharp contours at the knots (see 
Fig. 9-4). 


v_A-// 


Fig. 9-4 


Polynomials of high degree are not very useful for curve designing because of their oscillatory nature. 
The most useful piecewise polynomials are those for which the polynomials qfx) are cubic (degree 3). 
There are several reasons for this. One is that the piecewise cubic closely resembles the way a drafter uses a 
mechanical spline. In addition, 3 is the smallest degree which has the required smoothness properties for 
describing pleasing shapes. It is also the minimum number needed to represent three-dimensional curves. 


9.5 POLYNOMIAL BASIS FUNCTIONS 

Let P 0 (.y 0 , v 0 ), .... P„(x n ,y„) represent n + 1 data points. In addition, let t 0 , /, t 2 , . ■ . , be any numbers 
(called knots). The following are common choices for basis or blending functions. 


Lagrange Polynomials of Degree n 


JL x 

am = n- 

. 7=0 X j 





i = 0, 1, . . . , n 
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Note that Lfa) = 1 and L,(x/) = 0 for all j ^ i. (Here n represents term-by-tenn multiplication.) 


Hermite Cubic Polynomials 

Refer to Fig. 9-5. 


Hj(x) = 


H,(x) = 


2(x-t i _ l ) i | 3(x-/,„,) 2 
(ti-ti-rf (/, - C-i ) 2 
2(r i+ i -xf | 3(/ i+1 -x) 2 

(',+ 1 - hf ( f i+ i - hf 

(A- - t,_ I ) 2 (x - t,) 

W-k-if 

(x - r,)(f , + 1 - A-) 3 

('/ + i - tf 


ti<X<t i+ 1 
'i-1 — X < tj 

ti<x< t i+l 


B-Splincs 

Refer to Fig. 9-6. For the knot set t 0 , f,, t 2 , ■ ■ ■ , the nth-degree B-splines B i n are defined recursively: 


- 


^ ^ — X — k + 1 r , / » X tj ... tj+n + 1 x 

„ ,, . B in {X) — - T»/,n-lW +: 7 %l,ii-lW 

0 otherwise h+n k ^i+n + i i 1 1 


for tj < x < t i+n+ j. Note that B i n (x) is nonzero only in the interval [/), f, +n+1 ]. In particular, the cubic B- 
spline 5 i 3 is nonzero over the interval f,- +4 ] (which spans the knots f,, t l+l , t i+2 , t, +4 ). In addition, 

for nonrepeatcd knots, the B-spline is zero at the endknots /, and t i+n+ ] ( from Prob. 9.3), that is, 


B.Jtj) = 0 
Bj, n ( f i+n+\) ~ 0 


(n > 1) 





Fig. 9-5 Hennite cubic basis (unctions. Fig. 9-6 Cubic B-spline B, 3 (x). 
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In using B-splines, we allow for repeated knots, that is, h = t ,+ 1 = • • •• This means that B in can have the 
form 2 in its definition. By letting 2 — 0, we extend the definition of B in to incorporate repeated knots. 

Bernstein Polynomials 

Refer to Fig. 9-7. The Bernstein polynomials of degree n over the interval [0, 1] are defined as 



Fig. 9-7 Cubic Bernstein polynomials. 

The cubic Bernstein polynomials are 

B ( u (,x) = 1 — 3.v + 3.V 2 — x 3 
B u (x) = 3(x - Zx 2 + x 3 ) 

B 2 3 (x) = 3(.r 2 - x 3 ) 

B ?3 (x) = x- 3 

9.6 THE PROBLEM OF INTERPOLATION 

Given data points P 0 (.v 0 , Vo), . . . , P„(x„,y„), we wish to find a curve which passes through these points. 


Lagrange Polynomial Interpolation Solution 

Here 

L(x) = E.v-AW 

i=Q 

where L,(x) are the Lagrange polynomials and L(x) is the nth-degree polynomial interpolating the data 
points. 


Hermitian Cubic Interpolation Solution 

We wish to find a piecewise polynomial H(x) of degree 3 which passes through the data points and is 
also continuously ditferentiable at these points. We also prescribe the values of the derivatives / (or slope 
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of the tangent line) at the given data points, that is, we prescribe the points (jc 0 , Vq), .... (x n ,y'„): 

H(x) = E [ VfHjix) +.v'//,(.v)J 

i-0 

where H,{x) and H,(x) are the Hermitian cubic basis functions and t 0 = x 0 , r, = x u t 2 = x 2 . ■ ■ ■ , t„ = x„ are 
the choices for the knot set. 


Spline Interpolation 

If we require that the interpolating piecewise polynomial be joined as smoothly as possible at the data 
points, the resulting curve is called a spline. Therefore, a spline of degree m has continuous derivatives up 
to order m — 1 at the data points. 

It can be shown that any /nth -degree spline that passes through n + 1 data points can be represented in 
tenns of the B-spline basis functions B in as 

m-\-n - 1 

■?»,(*)= E 

i=0 

In order to define the B-spline functions B im (x) so as to solve the interpolation problem, the knots 
t 0 , /[,... , t m+n+l must be chosen to satisfy the Shoenberg- Whitney condition: 

U <Xi < t i+m+ i, i = 0, ...,n 

The following choices for the knots satisfy this condition (see Prob, 9.4): 


Step 1 . Choose 

%-“••• = 'm < *o 
Step 2. Choose the remaining knots according to 

. */+i H b x i+m 


'/+m+ 1 — ‘ 


111 


{ n + 1 — • • ‘ > X„ 


i = 0, . . . , n — m — 1 


For cubic splines (m = 3), an alternative to step 2, requiring less computation, is step 2'. Choose 


fy-4 ~ x i+ 2’ 


i = 0, . . . , n — 4 


The splines S 2 (x) and S 3 (x) are called quadratic and cubic splines , respectively: 

«+l /i+2 

S 2 (*) = E a i B i. 2 (x) and S,(x) = E a i B i rW 

i=0 i=0 

Confining our attention to the cubic spline, we see that there are n 4-3 coefficients a, to evaluate, requiring 
n + 3 equations. 

The interpolation criterion Sfyv,) = _y / , j — 0, . . . , n provides n + 1 equations: 

n\1 

V/ = S 3 (Xj) - E a i B i,i( x j) 

i-0 

The remaining two equations are usually specified as boundary conditions at the endpoints x 0 and x„. 
Some choices for boundary conditions are 


1 . Natural spline condition 


S$(x n ) = 0 SJ( x„) = 0 


2. 


Clamped spline condition 

b :A x q) — To B s( x n) — }’n 

where y o and are prescribed derivative values. 
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3. Cyclic spline condition 

SM = $(*„) S%(x 0 ) = S''(x n ) 

Tills is useful for producing closed curves. 

4. Anticvclic spline condition 

S'M — —S' 3 (x„) S$(x o) = — (.v„) 

Tliis is useful in producing splines with parallel endings whose tangent vectors are equal in 
magnitude but opposite in direction. 

For technical reasons, boundary condition 1, the so-called natural boundary condition, is the least 
preferred choice. 


9.7 THE PROBLEM OF APPROXIMATION 

The problem is to provide a smooth representation of a three-dimensional curve which approximates 
given data so as to yield a given shape. Usually the data is given interactively in the form of a guiding 
polyline determined by control points P 0 (*o,To> z o)- P\( x i^)’i > z i)- • ■ • . >’„■ z„). We would like to find 

a curve which approximates the shape of this guiding polyline (see Fig. 9-8). 


Pi 



Fig. 9-8 


Bezier-Bernstein Approximation 

Using the Bernstein polynomials, we form the parametric curves: 

MO = £ x t BE in (t) 

i—0 

P(t ): ■ >iO = ity,BE L „(t) 0 < / < 1 

/=o 

MO = £ ZjBE in (t) 

i=0 

where P(t) is called the Bezier curve. 
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Properties of the Bezier-Bernstein Approximation 

There are four basic properties: 

1 . The Bezier curve has the same endpoints as the guiding polyline, that is: 

p 0 = P( o) = MO), mo), 40)] p„ = P( l) = Mi), yi \ ). Mi)] 

2. The direction of the tangent vector at the endpoints P 0 , P n is the same as that of the vector 
determined by the first and last segments P {) P\, P„ \P„ of the guiding polyline. In particular 
p(0) = n ■ (P, - P 0 ) [i.e., x'(0) = n(x, - x 0 ), /(0) = n(y { - v 0 )> z'(O) = «(z, - z 0 )J and P'(l) = 
n(P„-P „_ ,). 

3. The Bezier curve lies entirely within the convex hull of the guiding polyline. In two dimensions, 
the convex hull is the polygon formed by placing a “rubber band” about the collection of points 

P<> Pn- 

4. Bezier curves are suited to interactive design. In fact, Bezier curves can be pieced together so as to 
ensure continuous differentiability at their juncture by letting the edges of the two different 
guiding polylines that are adjacent to the common endpoint be collinear (see Fig. 9-9). 



Bczicr-B-Spline Approximation 

For this approximation, we use B-splines (see Fig. 9-10) 


puy. 


X(t) = E 
(=0 

y(t) = E yA.m(0 

i = 0 

z(t) = ]b z i B i.m(Q 

i=0 


0 < t < n - m + 1 


The wth-degree B-splines i = 0 , are defined for t in the parameter range [0, n — m + 1]. 

The knot set / 0 , ... , t n+m+l is chosen to be the set 0, . . . , 0, 1 , 2, . . . , n — m, n — m + 1 , . . . , n — m + 1 . 

m+i m+1 

This use of repeated knots ensures that the endpoints of the spline coincide with the endpoints of the 
guiding polyline (Prob. 9.6). 

Since the knot spacing is uniform, we can also use an explicit form for calculating the B-splines (Prob. 
9.10). 
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Closed Curves 

To construct a closed B-spline curve which approximates a given closed guiding polygon, we need 
only choose the knots t 0 , . . . , t n+m+ \ to be cyclic, i.e., [0, 1 0, 1 , . . .]. So 

t m+ 2 = t\ = 1 f m+]+i = t i 

hi practice, the quadratic and cubic B-splines B i 2 and B i 2 are the easiest to work with and provide 
enough flexibility for use in a wide range of curve design problems. 


Properties of Bezier-B-Spline Approximation 

There are five basic properties: 

1 . The Bezier B-spline approximation has the same properties as the Bezier Bernstein approxima- 
tion; in fact, they are the same piecewise polynomial if m — n. This includes the properties of 
agreement with the guiding polygon and the tangent vectors at the endpoints and the convex hull 
property. 

2. If the guiding polyline has m + 1 consecutive vertices (control points) which are collinear, the 
resulting span of the Bezier B-spline will be linear. So this approximation allows for linear 
sections to be embedded within the curve. 

3. The Bezier B-spline approximation provides for the local control of curve shape. If a single 
control point is changed, portions of the curve that lie far away are not disturbed. In fact, only 
m + 1 of the spans are affected. This is due to the local nature of the B-spline basis functions. 

4. Bezier B-splines produce a closer fit to the guiding polygon than does the Bezier Bernstein 
approximation. 

5. The Bezier B-spline approximation allows the use of control points P, counted with multiplicities 

of 2 or more. That is, P t — ^i+i P ’ 1+ * for k > 1. This results in an approximation which is 

pulled closer toward this control point. In fact, if the point has multiplicity m + 1, the curve will 
pass through it. 
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9.8 CURVED-SURFACE DESIGN 

The modeling and approximation of curved surfaces is difficult and involves many complex issues. We 
will look at two methods for representing a surface: guiding nets and interpolating surface patches. 


Guiding Nets 

This technique is a direct generalization of the Bezier Bernstein and Bezier B-spline approximation 

methods for curves. A guiding net is a polygonal net with vertices Pfx^.Vjj. z (; ), /' = 0 m. and 

j =■ 0, .... n (see Fig. 9-1 1 ). 



1 . Bezier Bernstein surface. This is the surface with parametric equations: 


Q(s. t)\ 


x( S ,t) = ZZx i 'BE i Js)BEat) 

i =0 7=0 

m n 

y(s, o = E 

i =0 7=0 

m n 

Z(S, t) = E E Z ij BE i.rn (■ v )BE j.„(t) 

1=0 j- 0 


Here 0 < s, / < 1 and BE are the Bernstein polynomials. This approximation has properties 
analogous to the one-dimensional case with respect to the comer points P 00 , P m0 , P 0 „, and P mn . 
The convex hull property is also satisfied. 
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2. Bezier B -spline approximation. In parametric form this is expressed as 


Q(s, t): 


-v(.v, t) = ]T J2 x i iB l Js)Bj p(t) 

i—0j—0 
m n 

.'■(■v, t) = 

i—Oj—O 
m n 

z(x, t) = Y.H z ii B iA s ) B j,p(f) 

i=0j=0 


where 0 < $ < m — a + 1 and 0<s<n — f + 1. The knot sets for s and t used to define the B- 
splines B iy (s) and By p(t) are detennined as in the one-dimensional case. 

Quadratic approximation occurs when a = /? = 2. Cubic approximation occurs when a = — 3. In 

general, quadratic or cubic B-splines are most often used. For both these methods, the construction of the 
guiding net (by locating the control points Py) is left to the user. 


Interpolating Surface Patches 

Instead of using a given set of points Py to construct a given surface, the process of interpolating 
surface patches is based upon prescribing boundary curves for a surface patch and “filling in” the interior 
of the patch by interpolating between the boundary curves. 

1 . Coons surfaces. For this technique, a patch is determined by specifying four bounding curves, 
denoted in parametric vector fonn as P(s , 0), P(s, 1), P((), /), and Pfl, t), 0 < x, t < 1 (see Fig. 
9-12). 



Fig. 9-12 


The (linear) Coons surface patch interpolating the boundary curves can be written in vector 
fonn by using linear interpolation (or blending): 

Q(s , t ) = P(s , 0 )( 1 - t) + P(s, 1 )t + P( 0 , t)(\ -s) + P(\, t)s - P( 0 . 0)( 1 - s)( 1 - t) 

-P( 0 , 1)(1 -s)t- P(l,0)s(l -f)-PQ,l)W 

(The subtractions are required so that the interpolators between comer points are not counted 
twice. ) This idea can be extended to define more general surface patches. 

2. Lofted surfaces. Lofting is used where the surface to be constructed stretches in a given direction; 
an example is the hull of a ship. 
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Given two or more space curves, called cross-section curves , lofting is the process of blending 
the cross sections together using longitudinal blending curves. The simplest example is linear 
blending between cross-section curves Z 5 , (.v) and P 2 {s). The lofted surface is Q(s, t) = 
( 1 — t)P x (s) + tP 2 (s) (see Fig. 9-13). 



Cross section 2, P 2 (f) 


Fig. 9-13 


9.9 TRANSFORMING CURVES AND SURFACES 

All the curve and surface models that we have constructed have the general form 


x = Zx,4> ( - and _y = 

(Add a z component for three dimensions.) 

If M is 2 x 2 transformation matrix 



we can apply M to the functions x and y: 

( a b\f Sa-4> )_( a(Zxyly) + b&yp,) \_( Hax t + by t Yb, \ 

\c dJXLyfo) \c(I^^ l ) + d('Ly l (t> l )) \Z(cx, + dy,)*,) 

The transfonned functions are then 

x — l(ax l + byffQtj y — I(cr, + rfy,)®, 

In other words, to transform these curves and surfaces, it is necessary only to transform the coefficients 
(.v,, v’,). In most cases these coefficients represent data or control points. So the transformation of the 
approximation of a curve or surface is found by first transforming the control points and then forming the 
approximation based on the transformed points. 


9.10 QUADRIC SURFACES 

Spheres, cylinders, and cones are part of the family of surfaces called quadric surfaces. A quadric 
surface is defined by an equation which is of the second degree (in x,y, or z). 

The canonical quadric surfaces are as follows: 
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Sphere 

From Fig. 9-14: 






Ellipsoid 

From Fig. 9-15: 


/ z 2 

a 2 b 2 ^ c 2 


One-sheeted Hyperboloid 

From Fig. 9-16: 


x 2 

a 2 + b 2 c 2 


z 2 x 2 y 2 
c 2 a 2 b 2 


Two-sheeted Hyperboloid 

From Fig. 9-17: 
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Hyperbolic Parboloid 

From Fig. 9-20: 


x 2 y 

a 1 b 2 - “ 



I 

I 



Fig. 9-20 Fig. 9-21 


Elliptic Cone 

From Fig. 9-21: 


When a — b — R. we have a right circular cone. If we restrict z so that z > 0, we have the upper cone. 

Note that, except for spheres and ellipsoids, the mathematical definition produces a figure of infinite 
extent. To use it for computer graphics, we must specify bounds for the surface in the form of bounding 
clipping planes having the form z — h. 


1 7 

a z b 1 


9.11 EXAMPLE: TERRAIN GENERATION 

We show two different ways to generate a Mangle mesh (i.e., a polynomial net consisting of triangles) 
that models the random peaks and valleys of a mountainous landscape. 


Midpoint Displacement 

This is a recursive approach that begins with one or more triangles over the terrain area [see Fig. 
9-22 (a)]. We use tire midpoints of the edges to subdivide each triangle into four smaller ones. The 
midpoints are randomly elevated to produce a rugged appearance [see Fig. 9-22(6)]. The coordinates (x m , 
y,„, z m ) of the midpoint of the edge between P l (.v, , y l , z, ) and P 2 (x 2 ,y 2 ,z 2 ) are calculated as follows: 

’ x m = {x 2 — X\)/2 

■ y m = Oh - Ti )/2 + '■ 

. Zm =U 2 — Z])/2 
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Fig. 9-22 


where r is a random displacement that can be expressed as the product of a random number 0 < A < 1 and 
a function of the projected length (onto the x-z plane) of the edge. For example, if the function is 
F(p) — p/2, then the midpoint can be elevated by no more than half of the projected length of the edge. 
The subdivision process terminates when all edges project to a preset threshold or shorter length. 

Since the original vertices are not moved and each midpoint is displaced only once, the resulting 
terrain tends to maintain the overall shape of the initial model. On the other hand, deep valleys that look 
unrealistic may appear along the original edges. We can alleviate this problem by adding a random 
displacement to the vertices as well as the midpoints at each recursion stage. The tradeoff here is less 
control over the shape of the final terrain structure. 


Brownian Distribution of Disks 

In this approach we superimpose a grid onto the terrain area and initialize a counter for each grid cell 
(see Fig. 9-23 for a rectangular area that can be represented by a two-dimensional counter array). We then 
randomly move a circular disk over the area. At each disk position we increment the counter for every grid 
cell that is covered by the disk. After a while the counter values form an elevation profile of a pile of 
randomly placed disks. Each elevation value describes the height of a point/vertex above the correspond- 
ing grid cell. A triangle-mesh model of the resulting terrain can thus be constructed by first connecting 
each vertex to its four neighbors [analogous to the four neighbouring pixels in Fig. 3-1 8(a)] to form a 
quadrilateral-mesh and then adding diagonal edges to divide each quadrilateral into two triangles. 



Fig. 9-23 
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Fig. 9-24 


For example, the gray-scale image in Fig. 9-24 represents the values in a 5 12 x 512 counter array. The 
values are obtained using a disk of radius 25 after 6000 random movements, each of which is a step of size 
3 forward or backward and simultaneously a step of the same size to the left or to the right. Whenever the 
center of the disk moves beyond the image boundary it is reset to the middle of the image. The counter 
values are normalized so that the maximum count corresponds to white in the image. The resultant 
triangle-mesh is viewed in Fig. 9-25 using perspective projection, looking from a point near the lower-right 
comer of Fig. 9-24 towards the center (with hidden surfaces removed and visible surfaces lit by a light 
source). 



Fig. 9-25 
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Solved Problems 

9.1 Represent a cube by using an explicit vertex list. 

SOLUTION 

Using the notation in Fig. 7-12, one possible representation is 

V = { ABCDAFEDCHEFGHGB } 

Note the vertex repetitions in the list V and the repetition of edge FE as EF. 

9.2 Represent a cube by using polygon listing. 

SOLUTION 

Referring to Fig. 7-12, we form the vertex list 

V = { ABCDEFGH } 

The faces of the cube are polygons (which, in this case, are squares) P ]y ...,P 6 , where 

R, = \ABCD ) R 4 = \ABGF) 

P 2 = { CDEH ) R 5 = \BCHG } 

R, = \ADEF\ P 6 = \EFGH } 

Note the edge repetitions encountered in drawing the polygons. For example, polygons P 2 and R, share the 
edge DE. 


9.3 Show that the nth-degree B-spline basis functions B in (x) satisfy 

B in (x) = 0 if x < /,• or x > f (+n+1 


SOLUTION 

Since the B-spline basis functions R, „(x) are defined recursively in terms of the lower-order B-splines 
and 5, +1 „_|(x) of degree n — 1, we shall indicate only the first step of a general induction argument. 
Therefore, we illustrate what happens for the first-degree B-spline B i { (jc). 

Now suppose that .v < /,. Then also x < t l+i , so at x the zero-degree B-splines have the values B, 0 (x) = 0 
and R, + i.o(x) = 0. From these values we find, in turn, that B i X (x) = 0. 

Now suppose that the knot set is nonrepeating. Let x = Then 2? / 0 (f,) = 1, but B i+X u (tj) = 0. So 

BM = ~~~~ ( i ) + • f| ~~ ~ f '- • ( 0) = 0 

9+1 — 9 9+2 — 9+1 

Thus Bj ](x) = 0 if x < Similar arguments show that B iA (x) = 0 if x > t, +1 . 


9.4 Let R o (0, 0), Rj(l,2), R 2 (2, 1), R 3 (3,— 1), R 4 (4, 10), and R 5 (5,5) be given data points. If 
interpolation based on cubic B-splines is used to find a curve interpolating these data points, 
find a knot set / 0 , . . . , +, that can be used to define the cubic B-splines. 

SOLUTION 

The knot set can be chosen according to one of two schemes. With m = 3 and n = 5: 

1 . Choose 

<o = t, =t 2 =f 3 = -1 (<x 0 ) and t 6 = f 7 = = t 9 = 6 (>x„) 
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The remaining knots are chosen according to 

f i+m+l = ' 

So 


i = 0, — m — 1 


1+2+3 „ 2+3+4 , 

'*—5— “ 2 ~ = 3 

2. An alternative scheme for cubic spLines is 

h = h — ti = h = 1 = t-j = fg = tg — 6 

and the remaining knots are chosen according to 


';+4 ■ 


:jc i+2 , ( = 0, — 4 


So 


U = 2 t s = 3 

The agreement between knot sets chosen according to these two schemes is a result of the uniform 
spacing of the data points along the x axis. 


9.5 Write the equations that can be used to find an interpolating cubic spline curve to fit the data in 
Prob. 9.4 using cubic B-spline basis functions. 

SOLUTION 

With m = 3 and n = 5, the interpolating spline can be written in tenns of cubic B-splines as 

7 

S}(x) = X>A, 3 W 
/= o 

The interpolation equations for the data points (Xj,Vj),j = 0 5 are 

>) = S 3 (xj), 7 = 0 5 

With the knot set chosen in Prob. 9.4, the intervals [tj, ; J+4 ] defined by the knots tj. j = 0 9 are 


j 

<j 

tj+4 

0 

-1 

2 

I 

-1 

3 

2 

-1 

6 

3 

-1 

6 

4 

2 

6 

5 

3 

6 


Because B i 3 (x) is nonzero only for t t < x < t i+A , the interpolation equations become 

yj = Sy (xj) 

0 = a 0 B n3 ( 0) + 3 (0) + + a 3^3,3(0) 

2 = a u B 0 3 ( 1) + O] B\ 3 (1 ) + a 2 B 23 (\ ) + 03 ^ 3 , 3 ( 1 ) 

1 = a x B x 3 (2) + 02 ^ 2 , 3 ( 2 ) + a jB 33 (2) 

— 1 = a 2 B 2i (3) + a }B 33 (3) + a 4 S 4 3(3) 

10 = a 2 B 23 (4) + a 3 B 3 } (4) + fl 4 S 4 } (4) + a 3 B 3 3 (4) 

5 = a 2 B 23 (5) + a 3 B 3 3 (5) + a 4 B 43 ( 5) + a 3 B 3 3 ( 5) 
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The remaining two equations can be chosen to satisfy prescribed boundary conditions at .v 0 = 0 and x 5 = 5. 


9.6 Show that the knot set used in constructing the Bezier B-spline approximation to a guiding polyline 
guarantees that the endpoints of the spline coincide with the endpoints of the guiding polyline. 

SOLUTION 

For an m-degree Bezier-B-spline approximation, the knot set used is 

0 , 0 , 1 . 2 , . . . , n — mi , n — m + 1 . n — m 4 - 1 n — m + 1 

m+l m+1 

Let Pq(x 0 , y 0 , -o) be the first control point of the guiding polyline and P N (x ir y n , z n ) the last. Now 
x (/) = J2" =0 j with similar expressions for it?) and z(l). We wish to show that 

Jc(0) =a.- 0 ,.v( 0) =y 0 , r(0) = j 0 and x(n - 1) = x„,y(n - 1) =y„,z(n ~ 1) = z* 

Let us restrict ourselves to m = 2 (quadratic) B-splines. Then the knot set is 

0 , 0 . 0 , 1 , 2 , 3 , . . . , n — 2 , n — l,n — 1 , n — 1 

to? t| , t 2 , t 3 , / 3 , .... t n 3_2» t, ( ^_3, t, t +4 

Since B i2 (x) is nonzero only over the interval t, <x< t l+ 3 , it follows that 5, 2 (0) is nonzero only if i = 0, 1 , 
and 2. So then 

*(0) = Y, x i B i ?(°) = *o*u 2 (°) + X \ B \ 7(0) +x-,B n ,(0) 

i=0 

To calculate 5 ( , 2 , using the definition and the convention that Jj = 0, we obtain 

*0,2(0) = *1,1(0) and *i ,i(0) = fl 2 .o(0) and S 2 .o(0) = 1 

So B o2 (0) = 1 (compare with Prob. 9.3). 

To calculate B t 2 (0), using the definition 

2 2 

fi 12 (0) = - S 21 (0) and fi 2 ,(())=- — -Z? 30 (0) and 5 3 o (0) = 0 since 0 < f 3 = 1 

So we have B I 2 (0) = 0. In a similar manner, we find B 22 (0) = 0. Thus, x(0) = ,v 0 and the same for the y and z 
coordinates. 

Similar calculations show that x(n — 1) = x„, y(ri — 1) = y„, and z(n — 1) = z„. 


9.7 Find the linear Coons surface patch that interpolates the curves of Fig. 9-26. 



Fig. 9-26 
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SOLUTION 

The four bounding curves can be described parametrically as follows (see Fig. 9-26): 

1 . P(s, 0) = (cos ns, 0, sin ns) 0 < s < 1 

2. P(l,f) = (-1.0,0) 0 < / < 1 

3 P(s, 1 ) = (cos ns, sin ns, 0) 0 < s < 1 

4. P(0.r) = (1.0.0) 0 < / < 1 

Note that curves 2 and 4 are constant curves, that is, points. 

The linear Coons surface interpolating these curves can be written as 

Q(s. t) = P(s, 0)(l -t) + P(s. l)/4-P(0. t)(l — s) + P(\. t)s — P(0, 0)(1 -5)(1 -/) 
-P( 0, 1)(1 — s)t — />( 1 , 0)5( 1 — /) — p( 1 , 1)5/ 

In tenns of coordinates: 

■V(5, /) = (cos 7T5)( 1 — /) 4- (cos 7T s)l + ( 1 )( 1 — s) + (— 1 )s 

-0X1 -5)/-(-l)5(l -/)-(-l)5/ 


or 


Now 


x(s, /) = cos ns 


y(5, /) = (0)( 1 — /) 4- (sin ns)l + (0)( I — 5 ) + (0)5 — (0)( I — s)( 1 — /) — (0)( 1 — s)l — (0).r( 1 — /) — (0).s7 


or 


y(s, t) — t sin ra 


Finally 

z(5, /) = (sin ns)( 1 - /) 4- (0)/ + (0)( 1 - s) + (0)5 - (0)( 1 - s)( 1 - /) - (0)( 1 - s)t - (0)5( 1 - /) - (0)5/ 


or 


z(s, /) = (!— /) sin ns 


The linear Coons surface is 

Q(s, t) = [cos 7t5, / sin ns, ( 1 — /) sin 715 ] 


0 < 5 , / < 1 


9.8 


Find the lofting surface defined by linear blending between the cross-section curves in Fig. 9-27. 

SOLUTION 


The curves in Fig. 9-27 are circles whose equations can be defined parametrically as 
7 * 1 ( 5 ) = (cos 2ns, sin 27t5, 0) 0<5<1 


and 


The lotting surface is then 


P 2 (s) = (2 cos 2715, 2 sin 2ns, 4) 0 < 5 < 1 


Q(s, /) = (!— OP] ( 5 ) + tP 2 (s) 


In tenns of coordinates, we find that 

x(5, /) = ( 1 — /)(cos 2ns) 4-/(2 cos 2715) = (1 4- /) cos 27t5 
v(5, /) = ( 1 — /)(sin 2?t5) 4- 1(2 sin 27w) = ( 1 4- /) sin 2ns 
z(5,/) = (l -/)(0) 4-/(4) = 4/ 
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Fig. 9-27 


Thus 


Q(s. /) = [(! T- 1) cos 2ns, (1 + 1) sin 2 ns, 4 1 ] 0 < s. f < 1 


Supplementary Problems 


9.9 Represent a cube using an explicit edge listing. 

9.10 Find an explicit representation for linear (degree 1) B-splines in the case of unifonnly spaced knots, i.e., 
',+i - /, = L. 


9.1 1 For the knot set f, = l, t 2 = 2, . . . , = \, calculate B, 3 (5.5). 



CHAPTER 10 



Hidden Surfaces 


Opaque objects that are closer to the eye and in the line of sight of other objects will block those objects or 
portions of those objects from view. In fact, some surfaces of these opaque objects themselves are not 
visible because they are eclipsed by the objects’ visible parts. The surfaces that are blocked or hidden from 
view must be “removed” in order to construct a realistic view of the 3D scene (see Fig. 1 -3 where only 
three of the six faces of the cube are shown). The identification and removal of these surfaces is called the 
hidden-surface problem. The solution involves the determination of the closest visible surface along each 
projection line (Sec. 10.1). 

There are many different hidden-surface algorithms. Each can be characterized as either an image- 
space method , in which the pixel grid is used to guide the computational activities that determine visibility 
at the pixel level (Secs. 10.2, 10.5, and 10.6), or an object-space method, in which surface visibility is 
determined using continuous models in the object space (or its transformation) without involving pixel- 
based operations (Secs. 10.3 and 10.4). 

Notice that the hidden-surface problem has ties to the calculation of shadows. If we place a light 
source, such as a bulb, at the viewpoint, all surfaces that are visible from the viewpoint are lit directly by 
the light source and all surfaces that are hidden from the viewpoint are in the shadow of some opaque 
objects blocking the light. 


10.1 DEPTH COMPARISONS 

We assume that all coordinates (.v, y, z) are described in the normalized viewing coordinate system 

(Chap. 8). 

Any hidden-surface algorithm must determine which edges and surfaces are visible either from the 
center of projection for perspective projections or along the direction of projection for parallel projections. 

The question of visibility reduces to this; given two points P, (x, , iq , z, ) and P 2 (x 2 , y 2 , z 2 ), does either 
point obscure the other? This is answered in two steps: 

1. Are P, and P 2 on the same projection line? 

2. If not, neither point obscures the other. If so, a depth comparison tells us which point is in front of 

the other. 

For an orthographic parallel projection onto the xv plane, P l and P 2 are on the same projector if 
Xj = x 2 and — y 2 . In this case, depth comparison reduces to comparing z x and z 2 . If z l < z 2 , then P, 
obscures P 2 [see Fig. 10-l(u)]. 

For a perspective projection [see Fig. 10-1(6)], the calculations are more complex (Prob. 10.1). 
However, this complication can be avoided by transforming all three-dimensional objects so that parallel 
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projection of the transformed object is equivalent to a perspective projection of the original object ( see Fig. 
10-2). This is done with the use of the perspective to parallel transform T p (Prob. 10.2). 

If the original object lies in the normalized perspective view volume (Chap. 8), the normalized 
perspective to parallel transform 


NT p = 


(\ 0 
0 5 
0 0 

VO 0 


1 

2 

1 

2 


1 



1 


0 \ 
0 

~ z f 
~ z f 

0 / 



I 



Parallel projection of 
this configuration is 
[ABCDEFG] 


Perspective projection of 
this configuration is 
[ABCDEFG] 
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( where z,- is the location of the front clipping plane of the normalized perspective view volume) transforms 
the normalized perspective view volume into the unit cube bounded by 0 < x < 1, 0 < y < 1, 0 < z < 1 
(Prob. 10.3). We call this cube the normalized display space. A critical fact is that the normalized 
perspective to parallel transform preserves lines, planes, and depth relationships. 

If our display device has display coordinates H x V, application of the scaling matrix 


(H 

0 

0 

0\ 

0 

V 

0 

0 

0 

0 

1 

0 

^0 

0 

0 

1/ 


transforms the normalized display space 0 < x < 1, 0 < y < 1, 0 < z < 1 onto the region 0 <x < H, 
0 < y < V, 0 < z < 1. We cal this region the display space. The display transform DT p \ 

DT p = Sn,y,\ ■ NT p 

transforms the normalized perspective view volume onto the display space. 

Clipping must be done against the normalized perspective view volume prior to applying the transform 
NT p . An alternative to this is to combine NT p with the normalizing transformation (Chap. 8), forming 
the single transformation NT’ p — NT p ■ N per . Then clipping is done in homogeneous coordinate space. This 
method for performing clipping is not covered in this book. 

We now describe several algorithms for removing hidden surfaces from scenes containing objects 
defined with planar (i.e., flat), polygonal faces. We assume that the displays transform DT p has been 
applied (if a perspective projection is being used), so that we always deal with parallel projections in 
display space. 


10.2 Z-BUFFER ALGORITHM 

We say that a point in display space is “seen” from pixel (x, y) if the projection of the point is scan- 
converted to this pixel (Chap. 3). The Z-buffer algorithm essentially keeps track of the smallest z 
coordinate (also called the depth value ) of those points which are seen from pixel (.v, y). These Z values are 
stored in what is called the Z buffer. 

Let Z bui (x, y) denote the current depth value that is stored in the Z buffer at pixel (x,y). We work with 
the (already) projected polygons P of the scene to be rendered. 

The Z-buffer algorithm consists of the following steps. 

1 . Initialize the screen to a background color. Initialize the Z buffer to the depth of the back clipping 
plane. That is, set 

Z buf (x, y) = Z back , for every pixel (x, y) 

2. Scan-convert each (projected) polygon P in the scene (Chap. 3) and during this scan-conversion 
process, for each pixel (x, y) that lies inside the polygon: 

{a) Calculate Z(x,y), the depth of the polygon at pixel (x,y). 

(h) If Z(x,y) < Z buf (x,y), set Z buf (x, y) = Z(x,v) and set tire pixel value at (x,y) to the color of 
the polygon P at (x,y). In Fig. 10-3, points P, and P 2 are both scan-converted to pixel (x,y); 
however, since Z[ < z 2 , Pi will obscure P 2 and the P t z value, z h will be stored in the Z 
buffer. 

Although the Z-buffer algorithm requires Z-buffer memory storage proportional to the number of 
pixels on the screen, it does not require additional memory for storing all the objects comprising the scene. 
In fact, since the algorithm processes polygons one at a time, the total number of objects in a scene can be 
arbitrarily large. 
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10.3 BACK-FACE REMOVAL 

Object surfaces that are orientated away from the viewer are called back-faces. The back-faces of an 
opaque polyhedron are completely blocked by the polyhedron itself and hidden from view. We can 
therefore identify and remove these back-faces based solely on their orientation without further processing 
(projection and scan-conversion) and without regard to other surfaces and objects in the scene. 

Let N = (A, B, C) be the normal vector of a planar polygonal face, with N pointing in the direction the 
polygon is facing. Since the direction of viewing is the direction of the positive z axis (see Fig. 10-3), the 
polygon is facing away from the viewer when C > 0 (the angle between N and the z axis is less than 90°). 
The polygon is also classified as a back-face when C — 0, since in this case it is parallel to the line of sight 
and its projection is either hidden or overlapped by the edge(s) of some visible polygon(s). 

Although this method identifies and removes back-faces quickly it does not handle polygons that face 
the viewer but are hidden (partially or completely) behind other surfaces. It can be used as a preprocessing 
step for other algorithms. 


10.4 THE PAINTER S ALGORITHM 

Also called the depth sort or priority algorithm , the painter’s algorithm processes polygons as if they 
were being painted onto the view plane in the order of their distance from the viewer. More distance 
polygons are painted first. Nearer polygons are painted on or over more distance polygons, partially or 
totally obscuring them from view. The key to implementing this concept is to find a priority ordering of the 
polygons in order to determine which polygons are to be painted (i.e., scan-converted) first. 

Any attempt at a priority ordering based on depth sorting alone results in ambiguities that must be 
resolved in order to correctly assign priorities. For example, when two polygons overlap, how do we decide 
which one obscures the other? (See Fig. 10-4.) 
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Fig. 10-4 Projection of opaque polygons. 


Assigning Priorities 

We assign priorities to polygons by determining if a given polygon P obscures other polygons. If the 
answer is no, then P should be painted first. Hence the key test is to determine whether polygon P does not 
obscure polygon 0. 

The z extent of a polygon is the region between the planes z = z mm and z = z inax (Fig. 10-5). Here, z min 
is the smallest of the z coordinates of all the polygon’s vertices, and z^ is the largest. 

Similar definitions hold for the .r and v extents of a polygon. The intersection of the x,y, and z extents 
is called the extent, or bounding box, of the polygon. 



Testing Whether P Obscures Q 

Polygon P does not obscure polygon Q if any one of the following tests, applied in sequential order, is 

true. 

Test 0: the z extents o fP and Q do not overlap and z a of Q is smaller than z P . of P. Refer to Fig. 
10 - 6 . 

Test 1 : the y extents of P and Q do not overlap. Refer to Fig. 1 0-7. 

Test 2: the x extents of P and Q do not overlap. 
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Fig. 10-7 


Test 3: all the vertices of P lie on that side of the plane containing Q which is farthest from the 
viewpoint. Refer to Fig. 10-8. 

Test 4: all the vertices of Q lie on that side of the plane containing P which is closest to the 
viewpoint. Refer to Fig. 10-9. 

Test 5: the projections of the polygons P and Q onto the view plane do not overlap. This is checked 
by comparing each edge of one polygon against each edge of the other polygon to search for 
intersections. 


The Algorithm 

1 . Sort all polygons into a polygon list according to z niax (the largest z coordinate of each polygon’s 
vertices). Starting from the end of the List, assign priorities for each polygon P. in order, as 
described in steps 2 and 3 (below). 
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2. Find all polygons Q (preceding P) in the polygon list whose 2 extents overlap that of P (test 0). 

3. For each Q, perform tests 1 through 5 until true. 

(a) If every Q passes, scan-convert polygon P. 

(b) If false for some Q, swap P and Q on the list. Tag Q as swapped. If 0 has already been 
tagged, use the plane containing polygon P to divide polygon Q into two polygons. Q x and 
Qi [see Fig. 10-4(6)]. The polygon-clipping techniques described in Chap. 5 can be used to 
perform the division. Remove 0 from the list and place Q x and 0 2 on the list, in sorted 
order. 

Sometimes the polygons are subdivided into triangles before processing, thus reducing the computa- 
tional effort for polygon subdivision in step 3. 


10.5 SCAN-LINE ALGORITHM 

A scan-line algorithm consists essentially of two nested loops, an x-scan loop nested within a y-scan 
loop. 
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y Scan 

For each v value, say, y — a, intersect the polygons to be rendered with the scan plane y — ot. This 
scan plane is parallel to the xz plane, and the resulting intersections are line segments in this plane (see 
Fig. 10-10). 



Fig. 10-10 


A' Scan 

1 . For each x value, say, .v = //, intersect the line segments found above with the x-scan line x — ji 
lying on the y-scan plane. This intersection results in a set of points that lies on the x-scan line. 

2. Sort these points with respect to their z coordinates. The point (x, v, z) with the smallest z value is 
visible, and the color of the polygon containing this point is the color set at the pixel 
corresponding to this point. 

In order to reduce the amount of calculation in each scan-line loop, we try to take advantage of 
relationships and dependencies, called coherences, between different elements that comprise a scene. 


Types of Coherence 

1. Scan-line coherence. If a pixel on a scan line lies w'ithin a polygon, pixels near it will most likely 
lie within the polygon. 

2. Edge coherence. If an edge of a polygon intersects a given scan line, it will mot likely intersect 
scan lines near the given one. 
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3. Area coherence. A small area of an image will most likely lie within a single polygon. 

4. Spatial coherence. Certain properties of an object can be determined by examining the extent of 
the object, that is, a geometric figure which circumscribes the given object. Usually the extent is a 
rectangle or rectangle solid (also called a bounding box). 

Scan-line coherence and edge coherence are both used to advantage in scan-converting polygons 
(Chap. 3). 

Spatial coherence is often used as a preprocessing step. For example, when determining whether 
polygons intersect, we can eliminate those polygons that don’t intersect by finding the rectangular extent of 
each polygon and checking whether the extents intersect — a much simpler problem (see Fig. 10-1 1). [Note: 
In Fig. 10-11 objects A and B do not intersect; however, objects A and C, and B and C, do intersect. In 
preprocessing, coiner points would be compared to determine whether there is an intersection. For 
example, the edge of object A is at coordinate P$ — (6, 4) and the edge of object B is at coordinate 
P 4 - (7, 3).] Of course, even if the extents intersect, this does not guarantee that the polygons intersect. 
See Fig. 10-12 and note that the extents of A' and B’ overlap even though the polygons do not. 



Fig. 10-1 1 


Coherences can simplify calculations by making them incremental, as opposed to absolute. This is 
illustrated in Prob. 10.13. 


A Scan-line Algorithm 

In the following algorithm, scan line and edge coherence are used to enhance the processing done in 
the y-scan loop as follows. Since the y-scan loop constructs a list of potentially visible line segments, 
instead of reconstructing this list each time the y-scan fine changes (absolute calculation), we keep the list 
and update it according to how it has changed (incremental calculation). This processing is facilitated by 


206 


HIDDEN SURFACES 


[CHAP. 10 



the use of what is called the edge list, and its efficient construction and maintenance is at the heart of the 
algorithm (see Chap. 3, Sec. 3.7, under “A Scan-line Algorithm”). 

The following data structures are created: 

1 . The edge list — contains all nonhorizontal edges (horizontal edges are automatically displayed) of 
the projections of the polygons in the scene. The edges are sorted by the edge’s smaller y 
coordinate (y mm ). Each edge entry in the edge list also contains: 

(a) The x coordinate of the end of the edge with the smaller y coordinate. 

(b) The y coordinate of the edge’s other end (y max ). 

(c) The increment Ax = 1 jm. 

(d) A pointer indicating the polygon to which the edge belongs. 

2. The polygon list — for each polygon, contains 

(a) The equation of the plane within which the polygon lies- used for depth determination, i.e., 
to find the z value at pixel (x, v). 

( b ) An IN/OUT flag, initialized to OUT (this flag is set depending on whether a given scan line 
is in or out of the polygon). 

(c) Color information for the polygon. 

Tire algorithm proceeds as follows. 

I. Initialization. 

{a) Initialize each screen pixel to a background color. 

( b ) Set y to the smallest y min value in the edge fist. 

Repeat steps II and 111 (below) until no further processing can be perfonned. 

II. y-scan loop. Activate edges whose y mm is equal to y. Sort active edges in order of increasing x. 

III. x-scan loop. Process, from left to right, each active edge as follows: 
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(«) Invert the IN/OUT flag of the polygon in the polygon list which contains the edge. Count 
the number of active polygons whose IN/OUT flag is set to IN. If this number is l , only one 
polygon is visible. All pixel values from this edge and up to the next edge are set to the color 
of the polygon. If this number is greater than 1, determine the visible polygon by the 
smallest z value of each polygon at the pixel under consideration. These z values are found 
from the equation of the plane containing the polygon. The pixels from this edge and up to 
the next edge are set to the color of this polygon, unless the polygon becomes obscured by 
another before the next edge is reached, in which case we set the remaining pixels to the 
color of the obscuring polygon. If this number is 0, pixels from this edge and up to the next 
one are left unchanged. 

(b) When the last active edge is processed, we then proceed as follows: 

1. Remove those edges for which the value of y niax equals the present scan-line value y. If 
no edges remain, the algorithm has finished. 

2. For each remaining active edge, in order, replace x by x-\-\/m. This is the edge 
intersection with the next scan line y + 1 (see Prob. 10.13). 

3. Increment y to y + 1, the next scan line, and repeat step II. 

10.6 SUBDIVISION ALGORITHM 

The subdivision algorithm is a recursive procedure based on a two-step strategy that first decides 
which projected polygons overlap a given area A on the screen and are therefore potentially visible in that 
area. Second, in each area these polygons are further tested to detennine which ones wall be visible within 
this area and should therefore be displayed. If a visibility decision cannot be made, this screen area, usually 
a rectangular window, is further subdivided either until a visibility decision can be made, or until the screen 
area is a single pixel. 

Starting with the full screen as the initial area, the algorithm divides an area at each stage into four 
smaller areas, thereby generating a quad tree (see Fig. 10-13). 


A 


(a) Initial area. 



(b) First subdivision. 
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(c) Second subdivision. ( d ) Third subdivision. 


Fig. 10-13 
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The processing exploits area coherence by classifying polygons P with respect to a given screen area A 
into the following categories: (1) surrounding polygon — polygon that completely contains the area [Fig. 
10-14(n)], (2) interesecting polygon — polygon that intersects the area [Fig. 1 0-14(/>)], (3) contained 
polygon — polygon that is completely contained within the area [Fig. 10- 14(c)], and (4) disjoint polygon 
polygon that is disjoint from the area [Fig. 1 0- 1 4(c/)]. 



(c) P is contained in A. (</) P is disjoint from A. 

Fig. 10-14 


The classification of the polygons within a picture is the main computational expense of the algorithm 
and is analogous to the clipping algorithms discussed in Chap. 5. With the use of one of these clipping 
algorithms, a polygon in category 2 (intersecting polygon) can be clipped into a contained polygon and a 
disjoint polygon (see Fig. 10-15). Therefore, we could proceed as if category 2 were eliminated. 



Fig. 10-15 
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For a given screen area, we keep a potentially visible polygons list (PVPL), those in categories 1, 2, 
and 3. (Disjoint polygons are clearly not visible.) Also, note that on subdivision of a screen area, 
surrounding and disjoint polygons remain surrounding and disjoint polygons of the newly formed areas. 
Therefore, only contained and intersecting polygons need to be reclassified. 


Removing Polygons Hidden by a Surrounding Polygon 

The key to efficient visibility computation lies in the fact that a polygon is not visible if it is in back of 
a surrounding polygon. Therefore, it can be removed from the PVPL. To facilitate processing, this list is 
sorted by z min , the smallest z coordinate of the polygon within this area. In addition, for each surrounding 
polygon S, we also record its largest z coordinate, z, . 

x max 

If, for a polygon P on the list, z r > z s (for a surrounding polygon S), then P is hidden by S and 
thus is not visible. In addition, all other polygons after P ou the list will also be hidden by S, so we can 
remove these polygons from the PVPL. 


Subdivision Algorithm 

1 . Initialize the area to be the whole screen. 

2. Create a PVPL with respect to an area, sorted on z mjn (the smallest z coordinate of the polygon 
within the area). Place the polygons in their appropriate categories. Remove polygons hidden by a 
surrounding polygon and remove disjoint polygons. 

3. Perform the visibility decision tests: 

(a) If the list is empty, set all pixels to the background color. 

(b) If there is exactly one polygon in the list and it is a classified as intersecting (category 2) or 
contained (category 3), color (scan-convert) the polygon, and color the remaining area to the 
background color. 

(c) If there is exactly one polygon on the list and it is a surrounding one, color the area the color 
of the surrounding polygon. 

(d) If the area is the pixel (x, y), and neither a, b, nor c applies, compute the z coordinate z(x, y) 
at pixel (x, v) of all polygons on the PVPL. The pixel is then set to the color of the polygon 
with the smallest z coordinate. 

4. If none of the above cases has occurred, subdivide the screen area into fourths. For each area, go to 
step 2. 


10.7 HIDDEN-LINE ELIMINATION 

Although there are special-purpose hidden-line algorithms, each of the above algorithms can be 
modified to eliminate hidden lines or edges. Tins is especially useful for wireframe polygonal models 
where the polygons are unfilled. The idea is to use a color rule which fills all the polygons with the 
background color — say, black — and the edges and lines a different color — say, white. The use of a hidden- 
surlace algorithm now becomes a hidden-line algorithm. 


10.8 THE RENDERING OF MATHEMATICAL SURFACES 

In plotting a mathematical surface described by an equation z — F(x, y), where .v min < .t < x max and 
Train — )’ — Tmax’ we could use any of the hidden-surface algorithms so far described. However, these 
general algorithms are inefficient when compared to specialized algorithms that take advantage of the 
structure of this type of surface. 

The mathematical surface is rendered as a wireframe model by drawing both the x-constant curves 
z — F( const, >j and the y-constant curves z — F(x, const) (see Fig. 10-16). Each such curve is rendered as 
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a polyline, where the illusion of smoothness is achieved by using a fine resolution (i.e., short line 
segments) in drawing the polyline (Chap. 3). 



Fig. 10-16 


Choose an M x N plotting resolution 

*min < *1 < X 2 < ■ ■ < x M < and Jnun < V, < ,v 2 < • • • < y* < tw 

The corresponding z values are Zy — F{x t ,yj). An x-constant polyline, say x — x p has vertices 

P l (x j ,y\),...,P N (Xj,y N ) 

Siinilarly, the y = y k polyline has vertices 

Si (^l ■> .v*), .... £?,w (■%,>’*) 

Choosing a view plane and a center of projection or viewpoint C(a, b, c), we create a perspective view of 
the surface onto this view plane by using the transformations developed in Chap. 7. So a point 
[x,y, F(x, >)] on the surface projects to a point (p, q) in view plane coordinates. By applying an appropriate 
2D viewing transformation (Chap. 5), we can suppose that p and q line within the horizontal and vertical 
plotting dimensions of the plotting device, say, H x V pixels. 


The Perimeter Method for Rendering the Surface 

Each plotted x and v constant polyline outlines a polygonal region on the plotting screen (Fig. 10-17). 

The algorithm is based on the following observations: (1 ) ordering — the x- and y-constant curves (i.e., 
polylines) are drawn in order starting with the one closest to the viewpoint and (2) visibility — we draw only 
that part of the polyline that is outside the perimeter of all previously drawn regions (Fig. 10-18). One 
implementation of the visibility condition uses a min max array A, of length H (that of the plotting device), 
which contains, at each horizontal pixel position /, the maximum (and/or minimum) vertical pixel value 



i 



Fig. 10-17 


10 


Fig. 10-18 


10 
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drawl thus far at /; that is A(i) = (vertical pixel values drawn so far at /) (Fig. 10-19). Selection of the 
max results in a drawing of the top of the surface. The min is used to render the bottom of the surface, and 
the max and min yields both top and bottom. 


(0) (1) (2) 



I 10 

A: 0000000000 


Fig. 10-19 
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A: 5443464322 



I 10 

A: 7565466544 


Updating min-max array A (using maximum values only). 


The Visibility Test 

Suppose that ( //, q') is the pixel that corresponds to the point (p, q ). Then this pixel is visible if either 

q > A(p) or q < A(p) 


where A is the min-max array. 

The visibility criteria for a line segment are: (1) the line segment is visible if both its endpoints are 
visible; (2) the line segment is invisible if both its endpoints are not visible; and (3) if only one endpoint is 
visible, the min-max array is tested to find the visible part of the line. 

Drawing the x- or y-constant polylines thus consists of testing for the visibility of each line segment 
and updating the min-max array as necessary. Since a line segment will, in general, span several horizontal 
pixel positions (see segment in Fig. 10-19), the computation of A(i) for these intermediate pixels is 
found by using the slope of the line segment or by using Bresenham’s method described in Chap. 3. 


The Wright Algorithm for Rendering Mathematical Surfaces 

The drawing of the surface z = F(x,y) proceeds as follows. 

1 . To perfonn initialization, determine whether the viewpoint is closer to the x or the y axis. Suppose 
that it is closer to the x axis. We next locate the x-constant curve that is closest to the viewpoint at 
x — 1. 

(a) Initialize the min-max array to some base value, say, zero. 

(b) Start with the x-constant curve found above. 

2. Repeat the following steps using the visibility test for drawing line segments and updating the 
min max array each time a line segment is drawn: 

(a) Draw the x-constant polyline. 

(/>) Draw those parts of each y-constant polyline that lie between the previously drawn x- 
constant polyline and the next one to be drawn. 

(c) Proceed, in the direction of increasing x, to the next x-constant polyline. 
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Solved Problems 


10.1 Given points 1,2,0), P 2 (3,6, 20), anc ) /> 3 ( 2,4,6) and a viewpoint C(0, 0,-10), determine 
which points obscure the others when viewed from C. 

SOLUTION 

The line joining the viewpoint C(0, 0,-10) and point P t ( 1 . 2, 0) is (App. 2) 

x = t v = 2t z = — 10 + 10/ 

To determine whether P 2 (3, 6. 20) lies on this line, we see that x = 3 when / = 3, and then at f = 3, x = 3, 
y = 6, and z = 20. So P 2 lies on the projection line through C and P\. 

Next we determine which point is in front with respect to C. Now C occurs on the line at / = 0, P, occurs 
at / = 1 , and P 2 occurs at / = 3. Thus comapring / values, P, is in front of P 2 with respect to C: that is, P, 
obscures P 2 . 

We now' determine whether P,(2, 4, 6) is on the line . Now x = 2 when / = 2 and then y = 4, and z = 1 0. 
Thus Pj( 2, 4, 6) is not on this projection line and so it nether obscures nor is obscured by P t and P 2 . 


10.2 Construct the perspective to parallel transform T p which produces an object whose parallel 
projection onto the xy plane yields the same image as the perspective projection of the original 
object onto the normalized view' plane z = d z /(c' z + b) (Chap. 8, Prob. 8.6) with respect to the 
origin as the center of projection. 

SOLUTION 


The perspective projection onto the plane z = cL/Ui 4- b) with respect to the origin is (Chap. 7, Prob. 

7.4): 

/z„ 0 0 0\ 

0 z t , 0 0 

0 0 z e 0 

\0 0 10 / 

where z„ = c'./(c' : 4- b). The perspective projection onto the view plane of a point P(x, y, z) is the point 


Per = 




Define the perspective to parallel transform T p to be 

(z v 0 0 0 \ 

0 z„ 0 0 


T = 

p 


0 0 




>-*/ I"-/ 

\0 0 1 0 / 


(where z = z f is the location of the nonnalized front clipping plane; see Chap. 8, Prob. 8.6). 

Now, applying the perspective to parallel transform T p to the point P(x, v, z), we produce the point 



The parallel projection of Q' onto the xy plane produces the point 



So Q and P produce the same projective image. Furthermore, T p transforms the normalized perspective view 
volume bounded by x = z, x = — z, y = z, y = — z, z = z f , and z = 1 to the rectangular volume bounded by 
x = z„, a- = -z,„ y = z„, y = -z„, z = 0, and z = 1 . 
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10.3 Show that the normalized perspective to parallel transform NT p preserves the relationships of the 
original perspective transformation while transforming the normalized perspective view volume 
into the unit cube. 


SOLUTION 

From Prob. 10.2, the perspective to parallel transform T p transforms a point P(x,y,z) to a point 


'(“¥«) 


The image under parallel projection of this point onto the xy plane is 

*(?■?■•) 

The factor z„ can be set equal to 1 without changing the relation between points Q and Q'. 


The matrix that transforms P(x,y, z) to the point Q 




is then 


T = 

l p 


(\ 0 
0 1 
0 0 

Vo o 


0 \ 
0 


\-z f \-z f 


1 


0 / 


In addition, this matrix transforms the normalized perspective view volume to the rectangular view 
volume bounded by x = 1, x = — 1 , y = 1, y = — 1 , z = 0, and z — 1. 

We next translate this view volume so that the corner point (—1, —1,0) translates to the origin. The 
translation matrix that does this is 

(\ 0 0 1\ 

0 10 1 
0 0 10 
\0 0 0 1 / 

The new region is a volume bounded by x = 0, x = 2, y = 0, y = 2, z = 0, and z = I . 

Finally, we scale in the x and y direction by a factor | so that the final view volume is the unit cube: x = 0, 
x = 1 , y = 0, y = 1 , z = 0, and z = 1 . The scaling matrix is 


T OJ.o) - 


h/2,1/2,1 — 


(\ 0 0 0\ 
0^00 
0 0 10 


\0 0 0 1 / 

The final nonnalized perspective to parallel transfonn is 

fk o 


NT p — Sj/ 2 , i/ 2 ,i ’ T( 1,1,0) ■ T p — 


0 2 
0 0 
Vo o 


0 \ 

0 

~ : r 


• ~ 2 f >-*/ 

1 0 ) 


1 0.4 Why are hidden-surface algorithms needed? 

SOLUTION 

Hidden-surface algorithms are needed to determine which objects and surface will obscure those objects 
and surfaces that are in back of them, thus rendering a more realistic image. 
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10.5 What two steps are required to determine whether any given points obscures another 

point P 2 {x 2 ,y 2 , z 2 V (See Fig. 10-1.) 

SOLUTION 

It must be determined (1) whether the two points lie on the same projection line and (2) if they do, which 
point is in front of the other. 

10.6 Why is it easier to locate hidden surfaces when parallel projection is used? 

SOLUTION 

There are no vanishing points in parallel projection. As a result, any point P(a. b, z) will line on the same 
projector as any other point having the same x and y coordinates (a, b). Thus only the z component must be 
compared to determine which point is closest to the viewer. 

10.7 How does the Z-buffer algorithm determine which surfaces are hidden? 

SOLUTION 

The Z-bufFer algorithm sets up a two-dimensional array which is like the frame buffer; however the Z 
buffer stores the depth value at each pixel rather than the color, which is stored in the frame buffer. By setting 
the initial values of the Z buffer to some large number, usually the distance of back clipping plane, the 
problem of detennining which surfaces are closer is reduced to simply comparing the present depth values 
stored in the Z buffer at pixel (,v, vj with the newly calculated depth value at pixel (x,y). If this new value is 
less than the present Z-buffer value (i.e., closer along the line of sight), this value replaces the present value 
and the pixel color is changed to the color of the new surface. 

10.8 Using a 2 x 2 pixel display, show how the Z-buffer algorithm would determine the color of each 
pixel for the given objects A and B in Fig. 10-20. 



Display space 


Fig. 10-20 
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SOLUTION 

The display space for the 2 x 2 pixel display is the region 0 < x < 2, 0 < y < 2, and 0 < z < 1 . In Fig. 
10-20, A is the line with display space coordinates A y (j, 5, 0) and A 2 (2. 2, 0); line A is on the display screen in 
front of square B. B is the square with display space coordinates B ] (0,0,1), £,( 2,0, f), S 3 (2, 2, |), and 
B 4 (0, 2, i). The displayed image of A (after projection and scan conversion) would appear on a 2 x 2 pixel 
display as 


V 

a 

a 

y 


0 1 

where a is the color of A and y is the background color. We have used the fact (Chap. 3, Sec. 3.1) that a point 
(x,y) scan-converts to the pixel [Floor! a), Floor( v)]. (We assume for consistency that 2 s 1.99 — ) The 
displayed image of B is 


1 

0 



where b is the color of B. We apply the Z-buffer algorithm to the picture composed of objects A and B as 
follows: 

1 . Perform initialization. The Z buffer is set equal to the depth of the back clipping plane - = l, and the 
frame buffer is initialized to a background color y. 


1 

Frame buffer = 

0 


0 1 


V 

y 

y 

V 


1 

Z buffer = 

0 


0 1 


1 

1 

1 

1 


2. Apply the algorithm to object A. 

(a) The present Z-buffer value at pixel (0, 0) is that of the back clipping plane, i.e., Z bul (0, 0) = 1 . The 
depth vapue of A at pixel (0, 0) is z = 0. Then Z buJ (0, 0) is changed to 0 and pixel (0, 0) has the 
color of A. 


1 

Frame buffer = 

0 


0 1 


y 

y 

a 

y 


1 

Z buffer = 

0 


0 1 


1 

1 

0 

1 


(h) Object A is not seen from pixel (1, 0), so the Z-buffer value is unchanged. 


Frame = 


1 

V 

y 

1 

7, , — 

1 

1 

0 

a 

y 

0 

0 

1 


0 


0 1 


(c) Object A is not seen from pixel (0, 1 ), so the Z-butfer value is unchanged. 


Frame = 


1 

V 

y 

1 

7 . — 

1 

1 

0 

a 

V 

& 

1 

0 

0 

1 


0 1 


0 1 
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(</) The depth value of A at pixel (1 , 1 ) is 0. Since this is less than the present Z-buffer value of 1 , pixel 
(1,1) takes the color of A. 


Frame 


1 

V 

a 

1 

7, . — 

1 

0 

0 

a 

y 

^buf — 

0 

0 

1 


0 1 


0 1 


3. Apply the algorithm to B 
The depth 
unchanged. 


(a) The depth value for B at pixel (0, 0) is j, and Z bll ,(0. 0) = 0. So the color of pixel (0, 0) is 


Frame = 


1 

V 

a 

1 

7. , — 

1 

0 

0 

a 

y 

0 

0 

1 


0 I 


0 1 


( 6 ) The depth value of B at pixel (1,0) is 5 . The present Z-buffer value is 1 . So the Z-buffer value is set 
to i and pixel (1,0) takes the color of B. 


Frame = 


1 

y 

a 

1 

7, r ~ 

1 

0 

0 

a 

b 

^buf — 

0 

0 

1 

2 


0 1 


0 


(c) The depth value of B at pixel (0, I ) is 5 . The present Z-buffer value is 1 , so the color at pixel (0, I ) is 
set to that of B , and the Z buffer is updated. 


Frame = 


1 

b 

a 

1 

7i. r — 

1 

2 

0 

0 

a 

b 

^buf “ 

0 

0 

1 

2 


0 1 


0 1 


(c/) The depth value of B at pixel ( 1 , 1 ) is b . The present Z-buffer value is 0. So the color at pixel (1,1) 
remains unchanged. 


Frame = 


1 

b 

a 

l 

7, n ~ 

1 

2 

0 

0 

a 

b 

^buf — 

0 

0 

1 

2 


0 1 0 1 

The final form of the Z buffer indicates that line A lines in front of B. 


10.9 What is the maximum number of objects that can be presented by using the Z-buffer algorithm? 

SOLUTION 

The total number of objects that can be handled by the Z-buffer algorithm is arbitrary because each object 
is processed one at a time. 


10.10 How does the basic scan-line method determine which surfaces are hidden? 

SOLUTION 

The basic scan-line method looks one at a time at each of the horizontal lines of pixels in the display area. 
For example, at the horizontal pixel line y = a, the graphics data structure (consisting of all scan-converted 
polygons) is searched to find all polygons with any horizontal (y) pixel values equal to a. 
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Next, the algorithm looks at each individual pixel in the a row. At pixel (a, fi), the depth values (r values) 
of each polygon found above are compared to find the polygon having the smallest z value at this pixel. The 
color of pixel (a, fi) is then set to the color of the corresponding polygon at this pixel. 


10.11 Using the four pixel display and the graphics objects A and B from Prob. 10.8, show how the basic 
scan-line method would display these objects. 

SOLUTION 

First we initialize the display to the color v of the back clipping plane (located at z = 1). 


0 1 


Frame buffer = 


1 . Set y = 0. The scan-converted representations of A and B contain pixels on the y = 0 scan line. 

(a) Set x = 0. Comparing the z values of A and B at pixel (0, 0), we find that the smaller z value is 0, 
which belongs to A. Thus A is seen from pixel (0, 0); that is, pixel (0, 0) is set to the color of A. 


V 

y 

a 

y 


0 1 

(/>) Set x = 1. Since A is not seen from pixel (1, 0) while B is seen, the color of pixel (1, 0) is set to that 
of B. 


y 

y 

a 

b 


0 1 


2. Set y = 1. The scan -converted representations of A and B contain pixels on the _v = 1 scan line. 

(a) Set x = 0. Because A is not seen at pixel (0, 1 ) while B is seen, pixel (0, 1 ) is set to the color of B. 


b 

y 

a 

b 


0 1 

( b ) Set x = I . Both A and B arc “seen" from pixel (1, 1). The depth of A at pixel (1, 1) is 0, that of B is 
i. Thus A is visible at pixel (1, 1). 


b\ 

a 

a 

b 


0 1 


This represents the final image displayed. 
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1 0.1 2 Mow does edge coherence help to reduce computational effort? 

SOLUTION 

It is based on the assumption that, if an edge or line intersects a given scan line, it will most likely 
intersect those scan lines next to it. Thus if the pixels that intersect the edge are to be found, instead of 
intersecting each scan line with the edge, it is necessary to locate only one intersection pixel and then find the 
others using the slope m of the edge (see Prob. 10.13). 


10.13 Show how the calculation of the intersection of an edge with a scan One can be made incremental as 
opposed to absolute. 

SOLUTION 

The absolute calculation requires that we find the a: intersection value of the edge (e.g., with equation 
y = nix H- b) with the scan line y = a for each «. 

The incremental solution to the problem is based on the following observations. Suppose that x a is the x 
intersection of the edge with the scan line a. Then the intersection x x+i of the edge with the next scan line 
y = a + 1 can be found as illustrated in Fig. 10-21. From Fig. 10-21, where m is the slope of the edge, 

Av (a -I- 1) - a 

-r- = = m 

Ax *»+1 - x a 


Solving for x„ +1 , we obtain 


1 




Thus the calculation of the next intersection point is incremental; that is, it is found from the previous 
intersection point by adding 1 /m to it. 



1 0.14 How does area coherence reduce computational effort? 

SOLUTION 

Area coherence is based on the assumption that a small enough region of pixels will most likely lie within 
a single polygon. This reduces, as in the subdivision algorithm, computational effort in dealing with all those 
polygons that are potentially visible in a given screen area (region of pixels). 


10.15 How is spatial coherence determined? 

SOLUTION 

Spatial coherence is detennined by examining the extent of an object. The rectangular extent (bounding 
box) of an object is determined by finding the minimum and maximum x,y. and z coordinate values of the 
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points that belong to the object. The extent is the rectangular region bounded by the planes z = z^,,, z = z raa , 

y Train* y Tmax* ' -'"min 1 tUld X -'m:v\ • 

10.16 Draw two polygons such that their extents intersect but the polygons themselves don’t intersect. 

SOLUTION 

Figure 10-22 shows one possible solution. 


B 



Fig. 10-22 


10.17 Apply the scan-line algorithm from Sec. 10.5, under “A Scan-line algorithm,” to the display of 
objects A and B in Prob. 10.8. 

SOLUTION 

As in Prob. 10.8, a point (x, v) scan converts to the pixel [Floor(x), Floor(yj]. Also note that a 
nonhorizontal line can be treated as a special polygon: a scanline enters into and exits from the polygon at 
the same pixel position. 

We first construct the edge list (EL) and the polygon list (PL). From Fig. 10-20, the edge entries for the 
nonhorizontal edges A } A 2 , B } S 4 , and B 2 B$ are 


Edge 

.^rain 

X 

.^max 

1 

m 

Polygon pointer 

A\A 2 

1 

2 

\ 

2 

i 

A 

S,5 4 

0 

0 

2 

0 

B 


0 

2 

2 

0 

B 


Here, y lnin is the edge’s smaller y coordinate and x the corresponding x coordinate; y^x is the larger y 
coordinate, and m is the slope of the edge. Form the PL: 


Polygon 

Equation 

IN/OUT flag 

Color 

A 

X = V 

OUT 

a 

(line) 

z = 0 



B 

^ i 
* ~~ 2 

OUT 

b 


PL = 
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The algorithm proceeds as follows. 

I. Initialization. 

(a) We initialize the screen display to the color y of the back clipping plane (located at z = 1 ). 



(f>) Set the scan line to y = 0. 

II. y-scan loop. Activate edges that satisfy Floor(j- mm ) = 0 and sort on .v 



The number of active polygon s is o ne. Thus B is visible at pixel (0, 0) and all pixels on scan 
line v = 0 between edges 5, B 4 and A t A 2 are set to the color of B. In our case, this is just the pixel 
(0, 0 ). 




The number of active polygons is 2. At pixel (0, 0) the depth value (z value) of polygon A is 0 and of 
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polygon B is Thus polygon A is visible at pixel (0, 0), so the color is set to a. 



This means that we set the color of pixel (1, 0) to b. 



0 1 


(a) Again, we repeat step (a), processing edge B 2 B } . We update the PL: 



Polygon 


IN/OUT flag 

... 

PL = 

A 

— 

OUT 

— 


B 

— 

OUT 

— 


The number of active polygons is 0. 

( b ) Having processed B 2 B 2 , the last active edge, we proceed: 

( 1 ) All the v raav values are equal to 2. These values scan-convert (see Prob. 1 0.8) to 1 . The present 
v scan Line value is 0. So no edges are removed. 

(2) Incrementing x by \/m. 


B,5 4 

0 

A\A 2 

3 

2 

b 2 b 3 

2 


(3) Set y = 1. 

Now we repeat steps It and III. 

II. v-scan loop. With y = 1, no additional edge is activated. The result of sorting active edges remains as 
above. 

III. x-scan loop. 








222 


HIDDEN SURFACES 


[CHAP. 10 


(a) Process edge B|B 4 . We update the PL: 


Polygon 


IN/OUT flag 


A 

— 

OUT 

— 

B 

— 

IN 

— 


The number of active polygons is 1 . Thus B is visible at pixel (0, 1 ) and all pixels between edge 
B, B., and A t A 2 on scan line y = 1 are set to the color of B. 


b 

b 

a 

b 


0 1 


(a) Now we repeat step (a), processing edge A t A 2 . We update the PL: 


Polygon 


IN/OUT flag 


A 

— 

IN 

— 

B 

— 

IN 

— 


There are two active polygons. A depth comparison at pixel (1,1) shows that it is set to the color of 
line AjA 2 . 


b 

a 

a 

b 


0 1 


Since A is a line, we set the IN /OUT flag of A to OUT. And we proceed immediately to the next 
edge at the same pixel location. 

(a) Again we repeat step (a), processing edge B 2 B 3 . We update the PL: 


Polygon 

Equation 

IN/OUT flag 

Color 

A 

X = v 

OUT 

a 


z = 0 



B 

2 = i 

OUT 

b 


The number of active polygons becomes 0. 

(b) Having processed the last active edge B 2 B } , we remove those edges for which y nlax equals the y 
scan value of 1. Since this includes all the edges, i.e., y Ilm = 2, the algorithm stops. 


10.18 What is the underlying concept of the painter’s or the priority algorithm? 
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SOLUTION 

The painter’s algorithm sorts polygons by depth and then paints (scan-converts) each polygon onto the 
screen starting with the most distance polygon. 


10.19 What difficulties are encountered in implementing the painter’s algorithm? 

SOLUTION 

First, there is the question of what the “depth” of a polygon is, especially if the polygon is tilted out of 
the xv plane. Second, if two polygons have the same depth, which one should be painted first? 


10.20 If polygon Q has the same depth value as polygon P, which polygon has priority, that is, which is 
painted first? 

SOLUTION 

We perfbnn tests 0, 1,2, 3, 4, 5 (from Sec. 10.4, under “Testing Whether P obscures Q") in order. If any 
one of the tests is true, we say that polygon P does not obscure polygon Q, and so polygon P is painted first. 
If none of the tests is true, we have an ambiguity that must be resolved. 

We resolve the ambiguity by switching the roles of P and Q and the reapply tests 0, 1,2, 3, 4, and 5. If 
any one of these tests is true, Q does not obscure polygon P and so polygon Q is painted first. 

If again none of the tests is true, polygon Q must be subdivided into two polygons Q , and Q 2 , using the 
plane containing polygon P as the dividing plane [Fig. 10-4(i)]. 


10.21 Apply die painter’s algorithm to display objects A and B hi Prob. 10.8. 

SOLUTION 

We first find the depth values z mi)x for A and B. Since z max is the largest z value from all the polygon's 
vertices, then for A , = 0, and for B, z B = 5 . Then sorting on z max , we see that polygon B has a higher 

depth value than polygon A. 

Next, we assign priorities by applying tests 0 through 5 in order (see Sec. 10.4). In test 0, the z extent of B 
is -mm = 5 . -max = 5 - The z extent of A is z inill = 0, z^ = 0. 

Thus the z extents of A and B do not overlap and z , is smaller than z R . Thus test 0 is true, and so we 

y M*uix "tnin 

scan-convert polygon B first: 


1 

Frame buffer = 

0 



Next, we scan-convert polygon A (i.e., we “paint” over polygon By. 


b 

a 

a 

b 


0 I 


This is the final image displayed in the frame buffer. 


10.22 What are the basic concepts underlying die subdivision algorithm? 

SOLUTION 

First, is that a polygon is seen from within a given area of the display screen if the projection of that 
polygon overlaps the given area. Second, of all polygons that overlap a given screen area, the one that is 
visible in this area is the one in front of all the others. Third, if we cannot decide which polygon is visible (in 
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front of the others) from a given region, we subdivide the region into smaller regions until visbility decisions 
can be made (even if we must subdivide down to the pixel level). 

10.23 Apply the subdivision algorithm to the display of objects A and B from Prob. 10.8. 

SOLUTION 

1 . Initialization. We initialize the area to the whole screen. 

2. Forming the potentially visible polygon list. We create the PVPL, sorted on z mjn , the smallest of the z 
values of the polygon in the area. 


PVPL = 


3. Visibility decision. We apply criteria (a) through (d) in Sec. 10.6, step 3. Since all four tests fail, we pass 
to step 4 and subdivide the area into four subregions. 

After subdivision, the four newly formed regions are, in our example, the individual pixels. We apply the 
algorithm to each pixel in turn. 

Region 1 : pixel (0, 0). 

2. Forming the PVPL. 


PVPL = 


3. Visibility decision. Applying tests (a) thrpough (c), we now apply test (d) since the region is pixel size. 
The z coordinate of A at pixel (0, 0) is 0, and that of B is l. Thus A is visible at pixel (0, 0): 

1 

Frame buffer = 

0 

0 1 

Region 2: pixel (0, 1). 

2. Forming the PVPL. Note that A is disjoint from this region: 


PVPL = 






3. Visibility decision. From test (c), there is only one polygon and it is surrounding, so we color pixel (0, 1 ) 
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that of B: 




a 

b 


0 I 


Region 3: pixel (0, 1). 

2. Forming the PVPL. Since A is disjoint from this region, we have 


Polygon 

“min 

Category 

B 

1 

2 

Surrounding 


3. Visibility decision. From test (c), there is only one polygon B and it is surrounding. So region 3 is colored 
b: 


b 


a 

b 


0 1 


Region 4: pixel (1, 1). 
2. Forming the PVPL. 


PVPL = 


Polygon 

“min 

Category 

A 

0 

Intersecting 

B 

1 

2 

Surrounding 


3. Visibility decision. Having applied tests (a) through (c), we now apply test (d). The z coordinate of A at 
pixel (1, 1) is less than that of B. Thus pixel (1, 1) is set to the color of A: 


b 

a 

a 

b 


0 1 


This is the final image in the frame buffer. 

10.24 How can we use the special structure of a convex polyhedron to identify its hidden faces for a 
general parallel or perspective projection? 

SOLUTION 

Suppose that on each face of the polyhedron there is an outward-pointing normal vector N, attached at a 
point P of the face (Fig. 10-23). For each face of the polyhedron, let the line-of-sight vector L be the vector 
pointing from the face to the viewer. For a parallel projection, this is the direction of projection from the object 
to the projection plane. For a perspective projection, it is the vector PC’ from the normal vector attached at 
point P to the viewpoint at point C (Fig. 10-23). 
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A face is visible if the angle 0 made by the line-of-sight vector L and the nonnal vector N is less than 90 ° 
It is hidden if this angle is larger than or equal to 90°. If 0 < 6 < 90 \ then 0 < cos 0 < 1 : if 90 < 0 < 1 80 ‘ 
then —1 < cos 0 < 0. Since (from App. 2) 


cos 6 ■ 


L N 

IMINI 


the face is visible if 


„ L N , 
0 < < 1 

|L||N| - 


and hidden otherwise. To use this visibility test, vve need to find the outward-pointing nonnal vectors for the 
faces of the polyhedron. To do this, we label the faces of the polyhedron so that the polyhedron is oriented. 
That is, any edge shared by adjacent faces is traversed in opposite directions with respect to the faces (Fig. 
10-24). This guarantees that the nonnal vectors will point outward. To construct the outward nonnal vectors, 
we label each polygon with a counterclockwise labeling (Fig. 10-25). Then a nonnal vector can be found by 
taking the cross product of vectors detennined by two adjacent sides of the polygon and attaching it at one of 
the vertices (Fig. 10-25). 


Supplementary Problems 

10.25 How may the properties of parallel projection be used to simplify hidden-surface calculations for any form of 
projection? 

10.26 What happens when two polygons have the same z value and the Z-buffer algorithm is used? 

10.27 How would the Z-buffer algorithm be altered to allow figures to be superimposed on a surface (see Prob. 
10.26)? 

10.28 Assuming that one allows 2 24 depth value levels to be used, how much memory would a 1024 x 768 pixel 
display require to store the Z buffer? 

10.29 How can the amount of computation required by the scan-line method be reduced? 

10.30 How does scan-line coherence help to reduce computation? 

10.31 What is the extent of the polygon whose vertices are A( 0, 0, 1), B( 2. 0, 1), and C(l. 2, 2) (see Fig. 10-26)? 

10.32 Why are only nonhorizontal lines stored in the edge list of the scan-line algorithm? 

10.33 How is the depth of a polygon determined by the painters algorithm? 

10.34 How does the subdivision algorithm exploit are coherence? 

10.35 How can hidden-surface algorithms be used to eliminate hidden lines as applied to polygonal mesh models 
(Chap. 9)? 
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Color and Shading 

Models 


Color is a fundamental attribute of our viewing experience. The perception of color arises from light 
energy entering our visual system and triggering a chain of not-yet-fully-understood neurological and 
psychological responses. This complex process is relevant to computer graphics because a realistic image 
is one that seems indistinguishable from a tme record of the light energy coming from the real scene. It has 
been a constant challenge to find effective and efficient computational models for constructing such 
images. 

hi Sect. 11.1, we first outline the basic relationship between fight (the physical stimulus) and the 
perception of color. This leads to an international standard for the specification and measurement of colors, 
and a better understanding of the widely used RGB color model. In Sect. 1 1.2, we use a well-known 
formula to mimic the effect of objects being fit by light sources (A more elaborate illumination model is 
discussed in Chap. 1 2). In Sects. 1 1 .3 and 1 1 .4, we describe several useful techniques for shading polygon- 
mesh objects and for depicting delicate surface details (surface texture). 


11.1 LIGHT AND COLOR 

Light, or visible light, is electromagnetic energy in the 400 to 700 nm, i.e., nanometer ( 1 0 9 meter), 
wavelength (2) range of the spectrum ( see Fig. 1 1 -1 ). A typical light has its energy distributed across the 
visible band and the proportions are described by a spectral energy distribution function P{A) (see Fig. 
1 1-2). To model a light or reproduce a given light with physical precision one would need to duplicate 
the exact energy distribution, which is commonly referred to as spectral reproduction. 

On the other hand, it has been shown through carefully designed psychophysical experiments that 
discrepancy in spectral distribution does not necessarily lead to difference in perception. Less stringent 
approaches exist to reproduce light to the extent that the reproduction causes the same color sensation to an 
average human observer as the original. These results make it possible to specify or describe light in ways 
that are more perceptually oriented and are easier to handle. 
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Fig. 11-1 


Energy 

density 



400 


700 


Fig. 1 1-2 


Basic Characteristics of Light 

Light can be characterized in three perceptual terms. The first one is brightness, which corresponds to 
its physical property called luminance. Luminance measures the total energy in the light. It is proportional 
to the area bounded by P(/.) and the /. axis in the 400 to 700 nm range. The area can be calculated by 


P(X)Ak 

. l 

The higher the luminance, the brighter the light to the observer. 

The second perceptual tenn is hue, which distinguishes a white light from a red light or a green light. 
For a light with an idealized spectral distribution as shown in Fig. 11-3, hue corresponds to another 
physical properly called the dominant wavelength of the distribution. 
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The third perceptual term is saturation, which describes the degree of vividness. For example, two red 
lights may differ in luminance/brightness, and they may differ in degree of vividness (e.g., pure/saturated 
red vs. pale/unsaturated Ted). Saturation corresponds to the physical property called excitation purity, 
which is defined to be the percentage of luminance that is allocated to the dominant or pure color 
component (see Fig. 11-3). In other words, we have 


saturation 


pure color 

pure color + white color 


Although this simple scheme has its weaknesses (e.g., not all lights have an identifiable dominant 
wavelength), it bridges the physical aspects of light and the perceptual classification of color in a 
straightforward fashion. 


The Trichromatic Generalization Theory 

Let S be a given light or color stimulus. The effect of S (the color sensation of a human subject 
observing S) can be matched by combining light from three primary sources in appropriate proportions: 

S — r ■ red + g • green + b ■ blue 

In other words, the given light and the proportional mix of the three primaries look the same to the 
observer. 

Note that the theory stands with any triple of primaries. Three light sources form a triple of primaries 
as long as none of the three can be expressed as/matched by a linear combination of the other two. We use 
red, green, and blue in the fonnula mainly because they are the standard choice in color-matching 
experiments (with red = 700 run, green = 546.1 nm, and blue = 435.8 nm). Moreover, these three colors 
roughly coincide with the wavelength values that cause peak response from the three types of color- 
sensitive receptor cells in the retina, a membrane that lines the back of the eye’s wall. These receptor cells 
are called /(, <5, and p cones, respectively. They are most sensitive to light in the wavelength range of 
[i : 440 445 nm, (5 : 535-545 nm, and /> : 575-580 nm. Another kind of receptor cells called rods are color- 
blind but are very sensitive to low intensity fight. 

A critical aspect of this tri-stimulus approach is that, in order to match all visible colors, the weight 
values sometimes have to be negative. For example, when red, green, and blue are used as primaries the 
value of b, the weight of the blue component, may be negative. A negative b value means that the effect of 
the given stimulus S cannot be matched normally through the additive process. But if S is mixed with some 
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blue light, then the effect of the mix can be matched by a linear combination of red and green (Intuitively, 
we move the tenn b • blue to the other side of the equation). 


CIE XYZ Color Model 

Looking for a good compromise between the simple and effective tri-stimulus method and the fact that 
no naturally occurring primaries can be used to match all visible colors, the International Commission on 
Illumination (Commission Internationale de PEclairage, or CIE) defined three imaginary (non-realizable) 
primaries, vis., X, F, and Z, in 1931 . They were the result of an affine transformation applied to three real 
primaries to ensure that every single-wavelength light or spectral color can be expressed as a linear 
combination of the CIE primaries without any negative weight. 

The relative amounts of X, F, and Z that are needed to describe each spectral color are shown in Fig. 

1 1 -4 in the form of three color-matching functions x(X), y(i), and z{X). In order to match a color light of 
wavelength a 0 , the proper proportion is found by the line 2 — A 0 intersecting the curves representing the 
three functions. In addition, y(z) matches the luminous efficiency of the human eye and corresponds to the 
eye’s response to light of constant luminance. 



To describe an arbitrary light S with spectral distribution P(/), we “add together” the respective 
amounts of the CIE primaries that are necessary to match all the spectral components of S. This is done 
with 

X = k\ P(k)x(X) dA, F = k [ P(A)v(A) d/t, Z = k [ P(X)z(X)Ak 

Jt Jt Jt 

where k is a light-source-dependent constant. The resultant X, Y (which carries luminance information due 
to the way y(z) is defined), and Z are used as weights to express S as follows: 

S = X X+Y Y + Z Z 


CIE Chromaticity Diagram 

Now we define x, y, and 2 by normalizing the above weights against X+Y + Z: 

X Y Z 

X+Y + Z ' ' ~ X + Y + Z ’ Z ~ X+ Y + Z 

Clearly x+y + z = 1, and we have z = 1 — x —y. The two variables x and y represent colors by grouping 
them into sets, each of which has members that differ only in luminance. 
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Using x and v as the horizontal and vertical axes we can plot the well-known CIE Chromaticity 
Diagram (see Fig. 11-5). The curved triangular figure encompasses all perceivable colors by ignoring 
luminance. Only when two colors differ in chromaticity (hue and/or saturation) are they represented by 
two different points in the diagram (i.e., they have different chromaticity coordinates). 



Fig. 11-5 


To convert from chromaticity coordinates (x, y) back to a specific color in the XYZ color space we need 
an additional piece of information, typically Y : 

X=-Y , Y = Y, Z = 1 ~ X ~ y Y 

y y 

We can find all spectral colors along the figure’s upper border. Point C (0.310,0.316) is known as 
niuminant C. It corresponds to a reference white that is obtained from a spectral distribution close to 
daylight. 

The CIE chromaticity diagram serves as a universal reference for the specification and comparison of 
visible colors. The phosphors of a typical color monitor have the following chromaticity coordinates: R 
(0.62,0.34), G (0.29,0.59), and B (0.15,0.06). These coordinates define a triangular region within the 
diagram. The region is referred to as the color gamut of the monitor, which represents (the chromaticity 
range of) all the colors this monitor is able to display. 


Color Gamut Mapping 

Let (x r , y,.), (x g ,y g ), and (x b ,y b ) be the chromaticity coordinates of the phosphors of an RGB color 
monitor. What are the XYZ coordinates of its colors? We first introduce auxiliary variables 
C r = X r + Y r + Z r , C g = X g + Y g + Z„, and C h = X h + Y h + Z b , where (X r , Y r . Z,.), (X g , Y g , Z g ), and 
(X h , Y h , Z h ) denote the respective XYZ coordinates of the red, green, and blue colors the monitor can 
display. We have 

X r — x r C r , Y r — y r C r , Z, — z r C r where z r — 1 — x r — y r 

X g — x g C g , Y g — y g C g , Z g — z g C g where z g — 1 — x g — y g 

X b - x b C b , Y b —y b C b , Z h = z b C b where z b = 1 - x b - y b 
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Thus the XYZ coordinates of a composite RGB color can be expressed in terms of a transformation M: 

( x r C r x g C g x b C b \ / R\ (x r x g x b \(C r 0 0 \ 

y r c r y g C g y b C b G = \ y r y g y b 0 C g 0 

\ z r C r z g C g z b C b ) \ B ) \z r Zg z b )\ 0 0 C b ) 

Tliere are two different ways to determine C n C g , and C b . One is to use a photometer to measure the 
luminance levels Y r , Y g , and Y b of the red, green, and blue colors at their maximum intensity /brightness, 
respectively. Then 

G r = Y r /y r , C g = Y g /y g , C b = Y b /y b 

The other is to find/measure the XYZ coordinates (A',,,, Y w , Z w ) of the monitor’s white color 
(R — G = B — 1 ). Using these we have 

X A (*r x s 

Y w = >V 

/ \ Z r Z g 

We can now solve for C,„ C g , and C b . 

If M x is the transfonnation for monitor 1 and M 2 is the transfonnation for monitor 2, then a color 
(i?j, G[ , B x ) on monitor 1 can be matched by a corresponding color (R 2 , G 2 , B 2 ) on monitor 2: 

/M /M 

G, =M 2 l M { G, 

\b 2 J W 

provided that (R 2 , G 2 , B 2 ) is within the color gamut of monitor 2. 





The NTSC YIQ Color Model 

The NTSC (National Television System Committee) YIQ color model is used for commercial 
television broadcasting in the US. It is defined to be a linear transformation of the RGB color model, 
with the Y component purposely made to be the same as the Y in the CIE XYZ color model. Since Y carries 
luminance information, black-and-white television sets can use it to display color images as gray-scale 
pictures. The mapping between RGB and YIQ is as follows: 

(Y\ / 0.299 0.587 0.114 \ / R\ 

7 = 0.596 -0.275 -0.321 J J G I 

\q) \ 0.212 -0.523 0.311 / \ Bj 

The quantities in the transformation matrix are obtained using the standard NTSC RGB phosphors whose 
chromaticity coordinates are R (0.67,0.33), G (0.21,0.71), and B (0.14,0.08). It is also assumed that the 
white point is at x w = 0.31, v w = 0.316, and Y w = 1.0. 


11.2 THE PHONG MODEL 

This is a widely used and highly effective way to mimic the reflection of light from object surfaces to 
the viewer’s eye. It is considered an empirical approach because, although it is consistent with some basic 
principles of physics, it is largely based on our observation of the phenomenon. It is also referred to as a 
local illumination model because its main focus is on the direct impact of the light coming from the light 
source. On the other hand, a global illumination model attempts to include such secondary effects as light 
going through transparent/translucent material and light bouncing from one object surface to another. 

Now consider a point light source [see Fig. 11 -6(a)], which is an idealized light with all its energy 
coming out from a single point in space (a reasonable approximation to a bulb). Our eye is at the viewpoint 
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Light source 



View point 

V 


A 



<P 


(a) 


(*) 


Fig. 11-6 


looking at point Q on the surface of an object. What should be the color of Qfl In other words, what should 
be the color of the light reflected into our eye from Q (in the direction of vector V)? 

There are two extreme cases of light reflection. The first is called diffuse reflection. In this case light 
energy from the light source (in the direction of -L) gets reflected/bounced off equally in all directions 
(see the small arrows forming a half-circle/hemisphere). We also want the energy level of the reflection to 
be a function of the incident angle 9 (between L and surface normal N). The smaller the angle, the higher 
the reflection (kind of like bouncing a ball off a wall). The mathematical tool we use to achieve these is to 
have the reflection proportional to cos(fl). 

The second case is called specular reflection. It attempts to capture the characteristic of a shiny or 
mirror-like surface. Were the surface in Fig. 1 1 -6(a) a perfect mirror, energy from the light source would be 
reflected in exactly one direction (the direction of vector R). Since a perfect mirror is nonexistent we want 
to distribute reflected energy across a small cone-shaped space centered around R, with the reflection being 
the strongest along the direction of R (i.e., <p = 0) and decreasing quickly as cp increases [see the bell- 
shaped curve in Fig. 1 1-6(6)]. The mathematical means for modeling this effect is cos k (cp), where the 
parameter k provides for a convenient way to vary the degree of shininess (k = 1 for a dull surface and 
k — 100 for a mirror-like surface). For a given scene we can find out the amount of specular reflection in 
the direction of V using the actual angle ip between R and V. 

Furthermore, the complex inter-object reflection needs to be accounted for in some way because many 
surfaces we see are not lit directly by the light source. They are lit by light that is bouncing around the 
environment. For this we introduce a directionless ambient light, which illuminates all surfaces in the scene 
and gets reflected uniformly in all directions by each surface. 

Thus in the Phong model, object surfaces are thought to produce a combination of ambient-light 
reflection and light-source-dependent diffuse/specular reflection. Mathematically we can state the total 
reflected energy intensity I as 


7 = I s k. d + 7 p (A d cos(0) + A s cos *(</>)) 

where 7 a is the intensity of the ambient light; 7 p is the intensity of the point light; and 0 < A a , A d , k s < 1 .0 
are reflection coefficients that represent the surface’s ability to reflect ambient light, to produce diffuse 
reflection, and to produce specular reflection, respectively (e.g., A a = 0.2 means 20% reflection of 7 a ). 

If L, N, R, and V are all unit vectors, then L • N = cos(ff) and R V = cos(rp). We can write the 
formula as 


7 — 7 a Aj + 7 p (A d L • N + A S (R • V)*) 
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Note that the term L • N is used in computing R (see Prob. 1 1.10). When there are two or more light 
sources in the scene, their effects are cumulative: 

1 = kh + t y* d L, ■ N + k s (R, ■ V) k ) 

i— 1 

These formulas are typically used along with the RGB color model. Thus light intensity is in the form 
of an RGB color vector, e.g., 1 — (/,., /,,, I h ). Reflection coefficients are also three-dimensional vectors. For 
example, k A = (0.7, 0.7, 0.3) defines a surface that looks yellowish when illuminated with white light. The 
ambient reflection coefficient k A can simply be the same as k A . Tire three components of A' s are often made 
equal since the color of the reflection of a light source is typically the same as the color of the light source 
itself. Wren the light is white, the formula for a single point source becomes 

4 = l a k,r + /p(**L • N + A S (R • V)*) 

■ I g = IX, + /p( • N + * s < R ■ V)*) 

4 = ■ N + A S (R • V)*) 

Figure 1 1-7 shows a gray scale image with 16 views of a sphere shaded using the Phong formula. The 
four rows from top to bottom are calculated using k A /k s = 0.5/0. 1, 0.5/0.3, 0.5/0.5, and 0. 5/0.7, 
respectively. The four columns from left to right represent k = 3, 5, 20, and 100, respectively. 


###4 

9##4 

• • • 

• + m 0 


Fig. 11-7 


11.3 INTERPOLATIVE SHADING METHODS 

Computing surface color using an illumination model such as the Phong formula at every point of 
interest can be very expensive. It is even unnecessary when the image is used to preview' the scene. To 
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circumvent this problem, we apply the formula at full scale only at selected surface points. We then rely on 
such techniques as color interpolation and surface-nonnal interpolation to shade other surface points. 


Constant Shading 

The least time-consuming approach is not to perform calculation for additional surface points at all. 
The color values of those selected surface points are used to shade the entire surface. 

For example, a cylindrical object may be modeled by a polygon mesh as shown in Fig. 1 l-8(a). We can 
evaluate the Phong fonnula using the geometric center of each polygon and set the resultant color to every 
pixel that belongs to the corresponding polygon [see Fig. 11-8(6)]. 





(a) (b) 

Fig. 11-8 

Constant shading can produce good results for dull polyhedrons lit by light sources that are relatively 
far away (hence the diffuse component varies little within each polygonal face). The primary weakness of 
this method is that false contours appear between adjacent polygons that approximate a curved surface [see 
the cylindrical object in Fig. 1 1 -8(6)]. In addition, the specular reflection of the light source (often referred 
to as the specular highlight ) tends to get lost. On the other hand, if a selected surface point happens to be at 
the location of the strongest specular reflection of the light source, then the color of the corresponding 
polygon will be, for the most part, significantly distorted. 


Gouraud Shading 

In this approach we evaluate the illumination formula at the vertices of the polygon mesh. We then 
interpolate the resultant color values to get a gradual shading within each polygon. 

For example, we use bilinear interpolation to find the color of point P inside a triangle whose vertices 
are P, , P 2 , and P } (see Fig. 1 1-9). The scan line containing P intersects the two edges at points P’ and P" . 
We interpolate the color of P l and the color of P 2 to get the color of P’ . We then interpolate the color of P 2 
and the color of P 3 to get the color of P". Finally, we interpolate the color of P' and the color of P" to get 
the color of P (see Prob. 11.15). 

Wien polygons are used to approximate a curved surface, there are two ways to determine the normal 
vectors at the vertices in order to get a smooth-looking transition between adjacent polygons. The first is to 
use the underlying curved surface to find the true surface normal at each vertex location. For example, the 
vertices of the polygon mesh in Fig. 11 -8(a) are on the surface of an underlying cylinder. The normal 
vector at a point on the cylinder is perpendicular to the axis of the cylinder and points from the axis to the 
point [see Fig. 1 1-1 0(a)]. 

On the other hand, it may sometimes be difficult to find normal vectors from the underlying surface, or 
the polygon mesh may not come from any underlying surface. A second approach to deciding normal 
vectors at the vertices is to average the normal vectors of adjacent polygons. For example, to determine 
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normal vector N at vertex P in Fig. 1 1-10(6), we use the average of the normal vectors of the polygons that 
meet at P: 

N = N, + N 2 + N, + N 4 

where N is then normalized by dividing it by |N|. 

Common normal vectors at the vertices of a polygon mesh that approximates a curved surface result in 
color values that are shared by adjacent polygons, eliminating abrupt changes in shading characteristics 
across polygon edges. Even when the polygon mesh is relatively coarse, Gouraud shading is very effective 
in suppressing false contours. However, a much finer mesh is needed in order to show reasonably good 
specular reflection. 


Phong Shading 

Instead of color values, we may also interpolate normal vectors. For example, to find the normal vector 
N at point P in a triangle (see Fig. 1 1-1 1) , we first interpolate N , and N? to get N', then interpolate N 2 and 
N 3 to get N", finally interpolate N' and N" to get N. 



Fig. 11-11 




CHAP. 11] 


COLOR AND SHADING MODELS 


239 


This technique is relatively time-consuming since the illumination model is evaluated at every point of 
interest using the interpolated nonnal vectors. However, it is very effective in dealing with specular 
highlights. 


11.4 TEXTURE 

While gradual shading based on illumination is an important step towards achieving photo-realism, 
most real-life objects have regular or irregular surface features (e.g., wood grain). These surface details are 
collectively referred to as surface texture. In this section we present three approaches to adding surface 
texture: projected texture, texture mapping , and solid texture. 


Projected Texture 

Imagine putting a plain object between a slide projector and the projection screen. The surfaces of the 
object that face the projector will be covered by the projected image. If the image depicts wood grain then 
the affected surfaces will have the intricate wood texture superimposed onto its original shading. 

In computer graphics, projecting texture onto an object is effectively the inverse of projecting an object 
onto the view plane (compare Fig. 7-1 and Fig. 11-12) . We now refer to the view plane as the reference 
plane, which contains a two-dimensional texture definition called the texture map. The projection line that 
associates point P with its image P' allows the shading of P to be determined or influenced by information 
in the texture map at location P'. 



Fig. 11-12 


The texture map is often a synthesized or scanned image, in which case it contains color attributes. 
There are several ways for the color attributes of P' to be used to shade point P. First, we may simply assign 
these attributes to P, replacing whatever color attributes P currently has. The net effect of this approach is 
to paint the color of the texture onto the object. Second, we may interpolate between the color C of P' and 
the original color C of P using ( 1 - k)C + kC where 0 < k < 1 .0 to come up with a new color for P. The 
result of this calculation makes the texture appear blended into the original shading. Third, we may use a 
logical operation involving C and C to compute a new color for P. For example, when the AND operation 
is used, a texture area with magenta shades will appear unaltered if the original color is white, but it will 
take on red shades if the original is yellow. 

Projected texture is an effective tool when target surfaces are relatively flat and facing the reference 
plane. On the other hand, “hidden surfaces" are not affected, and features of the texture are distorted by the 
curvature of the surface. 
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Texture Mapping 

Now imagine the texture map being a thin and flexible sheet made of elastic material. When we wrap it 
around the surface of an object we can stretch or shrink it so as to follow the shape of the object. 

This approach is referred to as texture mapping. We describe the texture map in a two-dimensional 
space called the texture space with coordinates (u, w), and w'e represent the surface of an object in 
parametric form using (6, <p). The mapping from the texture space to the object space is then defined by 

6 =/(«, w) cp — g(u y w) 

and the inverse mapping by 

u = r(6, ip) w = s(6 , cp) 

Although not necessary, these mapping functions are generally assumed to be linear (so the texture map 
stretches and shrinks proportionally). Thus we can write 

0 — Au + B tp — Cw + D 

where the constants A, B, C, and D are obtained from the relationship between known points in the two 
spaces (e.g., comers of the texture map and the corresponding surface points). 

As mentioned early, the texture map often contains color attributes that can be used in different ways to 
alter the original shading. In addition, its role may be extended to provide other kind of information that 
affects the outcome of the shading process. For example, a technique called bump mapping uses the 
quantities in the texture map to displace or perturb the normal vectors of the surface to produce an 
appearance of roughness (see Fig. 11-13 for a one-dimensional illustration of the effect). 



Smooth-looking A bump map to Rough-looking 

surface perturb the surface 

magnitude and 
direction of the 
surface normals 

Fig. 11-13 

In comparison with projected texture, the method of texture mapping can better adapt to a curved 
surface. However, complex shapes deter the derivation of mapping functions. Furthermore, both methods 
are particularly weak w'hen features of the texture on one surface should “match” those on others. Consider 
a wood cube, for instance: the wood grain on one side should exhibit continuity and consistency with the 
wood grain on all adjacent sides. As we turn the wood cube into a sculpture, the complexity of this texture- 
matching problem simply multiplies. 


Solid Texture 

The weaknesses of projected texture and texture mapping largely stem from the inherent mismatch 
between a two-dimensional texture map and a three-dimensional surface. In this approach we define texture 
using a three-dimensional texture space. The texture definition, referred to as solid texture, is a three- 
dimensional representation of the internal structure of some nonhomogeneous material such as wood, 
marble, etc. 

When rendering an object made of a specific material, we use transformations to place the object into 
the coordinate system where the texture for the material is defined. In other words, we map a surface point 
P(x, y, z) to its coordinates (ti, v, u j in the texture space. Information associated with this location is then 
used to control the shading of/! This makes the object look as if it were carved out of a solid piece of the 
chosen material. 
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(a) 


( 6 ) 

Fig. 11-14 




Solid texture is often defined procedurally. The following is an example that shows the development of 
a model for wood grain. We begin by representing the most prominent feature of the material with a series 
of concentric cylinders in the texture space [see Fig. 1 1-1 4(a)]) . When an object such as the parallelepiped 
in Fig. 1 1-1 4(b) is placed into the texture field, the object intersects the cylinders, leaving intersection 
contours on its surfaces. The contours we get by aligning the object coordinate axes with the texture 
coordinate axes are shown in Fig. 1 1- 14(c). We now rotate the front end of the parallelepiped slightly to the 
left about the v axis, and slightly up about the u axis. The result is our first approximation to wood grain 
(see Fig. 1 1-15). 



We can see that the contours on different sides are consistent with the underlying three-dimensional 
structure, but they are just too smooth and too perfect to look real. In order to incorporate some asymmetry 
and irregularity we need to bring two updates to the model. The first is to add ripples or sinusoidal 
perturbations to the cylinders so the intersection contours will have some minor and natural-looking 
oscillations. The second is to turn or twist the ripples slowly along the w axis to give the contours a skewed 
appearance. An implementation of the improved cylinder model is shown in the following pseudo-code 
procedure: 

wood(irain(/JL kd) 

{ 

float radius, angle; 

radius = sqrt(pf[w] 2 I pf[v] 2 ); 

angle = (pt[u] = 0) ? n/2 : arctan(/u“[v]/pf[n]); 

radius = radius + w*sin(/*angle + r*/?/[w]); 
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if ((int)radius % p <d) 

set the components of kd to produce a dark shade; 
else 

set the components of kd to produce a light shade; 

} 

where array parameter pi represents the coordinates of surface point P in the texture space, and array 
parameter kd is used to return the proper reflection coefficients for shading point P. 

The first instruction in woodGrain is to determine the radius of the cylinder that intersects P. The next 
instruction sets the base parameter value for the sine function that delivers the ripple effects. The 
magnitude of the perturbation is controlled by m, the frequency or number of ripples in a full circle is 
controlled by/ and the rate at which the ripples rotate along the w axis is controlled by r. Finally, we treat 
wood grain as alternating bands of dark and light cylinders of period p. Within a period the two bands are 
separated by threshold d. Although only two shades are shown in the procedure, the value of kd should be 
made to vary based on the relative position of P within each band to produce better results (see Fig. 11-16 
for a ray-traced sphere). 



Fig. 11-16 


Solved Problems 


11.1 Assuming that the medium of interest is air (or a vacuum) express the visible band of the spectrum 
in terms of a frequency range. 

SOLUTION 

Since frequency = speed of light/wavelength, and speed of lights 3.0 x 10 8 m/s, we obtain the 
frequency range (3.0 x 10 8 )/(700 x 10 -9 ) to (3.0 x 10 8 )/(400 x 10 _9 )Hz or 4.3 x 10 14 to 7.5 x 10 14 IIz. 


11.2 Name the three perceptual terms for describing color and the corresponding physical properties of 
light. 
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SOLUTION 

Brightness/luminance, hue/dominant wavelength, saturation/excitation purity. 


1 1.3 Derive a simple formula to calculate the area bounded by the distribution function P(X) in Fig. 1 1-3. 

SOLUTION 

Let IF and // be the width and height of the rectangular area that corresponds to white, and D and P be the 
width and height of the rectangular area that corresponds to pure color. The area bounded by P(A) is 
W x H + D x P. 


11.4 Name the two kinds of receptor cells in the retina and describe their basic function. 

SOLUTION 

They are cones for seeing colors, and rods for perceiving low intensity light. 


11.5 Wiry did we say that red, green, and blue only roughly coincide with the wavelength values that 
cause peak response from the three types of color-sensitive cones? 

SOLUTION 

The three colors have the following typical wavelength: red = 700nm, green = 546.1 nm, and 
blue = 435.8 nm. Although blue and green closely match the sensitive range of two types of cones, 
p : 440 — 445 nm and <5 : 535 — 545nm, the sensitive range of the third type, p : 575 — 580 run, is actually 
yellow, not red. 


1 1.6 Do we have to use Y in order to convert from chromaticity coordinates ft, y) back to a specific color 
in the XYZ color space? 

SOLUTION 

No. If we know X, we can use 

V 1 — x — V 

X=X. Y=--X. Z = -X 

X X 


11.7 Presume that a monitor produces what is called the standard white D 65 with x w = 0.313, 
y w = 0.329, and Y w = 1.0 when R = G = B — 1 , and the chromaticity coordinates of its phosphors 
are as given in Sec. 11.1. Find the color transformation matrix M for the monitor. 

SOLUTION 


Since Y w = 1 .0, we have 

W w = x w /v w = 0.95 1 and Z K = ( 1 - x w - y w )/y w =1.088 

Use these and the chromaticity coordinates of the phosphors 

( 0.951 \ / 0.62 0.29 0.15\/C,.\ 

1.0 = I 0.34 0.59 0.06 Q 

1.088/ \0.04 0.12 0.79/ \cj 

Solve and get C r = 0.705. C g = 1.170, and C b = 1.164. Now 


X f t? 

X g^g 

x b C b \ 

/ 0.437 

0.339 

0.175 

y r C r 

y s c g 

) = 

= I 0.240 

0.690 

0.070 

-rC r 

r 

-g g 

-nQ / 

\ 0.028 

0.140 

0.920 
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1 1.8 Verify the fact that the Y in the CIE XYZ color model is the same as the Y in the NTSC YIQ color 
model. 

SOLUTION 

Find the transfonnation to XYZ for the standard NTSC RGB display (see Sec. 11.1). Since 
x w = 0.31, v w = 0.316, and fj, = 1.0, we have 

X w = x w /y v = 0.98 1 and Z w = ( 1 - x w - y w )/y w = 1 . 1 84 

Use these and the chromaticity coordinates of the standard NTSC phosphors 

/ 0.981 \ / 0.67 0.21 0 . 1 4 \ / C r \ 

1.0 = 0.33 0.71 0.08 I C g 

V 1.184/ \0.0 0.08 0.78/ \C b ) 

Solve and get C,. = 0.906, C g = 0.826, and C h = 1 .432. Now 


/ x r C r 

x g*~g 

•vG \ 

/ 0.607 

0.173 

0.200 \ 

M = v r C r 

YgCg 

>’*G 

= 1 0.299 

0.587 

0.114 

U Cr 

2 C 

z b c t ) 

\0.0 

0.066 

1.117 / 


Since the middle row of M is the same as the top row of the matrix for mapping from RGB to YIQ , we can see 
that the Y in CIE XYZ is the same as the Y in NTSC YIQ. 


1 1.9 What is the difference between a local illumination model and a global illumination model? 

SOLUTION 

A local illumination model focuses on the direct impact of the light coming from the light source. On the 
other hand, a global illumination model attempts to include such secondary effects as light going through 
transparent/translucent material and light bouncing from one object surface to another. 


1 1 .1 0 Refer to Fig. 1 1 -6 in Sec. 11.2. Find a formula to compute R. the reflection of vector L with respect 
to normal vector N. 

SOLUTION 

Introducing auxiliary vectors e and m (see Fig. 1 1-17), we can write 

R = m — e 



Since L = e + m, we have e = L - m and 

R = in — (L — m) = 2m - L 

Note that m is simply the perpendicular projection of L onto N (see Prob. A2.16) and N is a unit vector, hence 


m = (L NjN 
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Finally we have 


R = 2(L ■ N)N - L 


11.11 The term R • V in the Phong formula is sometimes replaced by N • H, where H is a unit vector that 
bisects the angle between L and V Show that R • V ^ N H. 

SOLUTION 

Let L, N, and V be coplanar. Since L, N, and R must be coplanar from the laws of optics, and L, V, and 
H, the bisector of the angle between L and Y must also be coplanar, all the vectors are coplanar (see Fig. 
11-18). From the angles shown on the left half of the drawing we have 

P = 6 + a 



Fig. 11-18 

From the angles shown on the right half of the drawing we have 

<x + P = 0 + <p 

Combining the two expressions we get 

<p = 2a 

Unless (p = a = 0, we have <p ^ a and cos(tp) # cos(a). Since R V = cos(<p) and N H = cos(a), we have 

R V # N H 


1 1.12 The color of an object is largely determined by its diffuse reflection coefficient. If k d = (0.8, 0.4, 0) 
and the light is blue, what is the color of the object? 

SOLUTION 

Black, since the object does not reflect blue light and there is no red and green light for it to reflect. 


11.13 Refer to Prob. 11.12. What if we use magenta light? 

SOLUTION 

Red, since the object only reflects the red component of the magenta light. 


11.14 When a light source is relatively far away from a polyhedron, the diffuse reflection determined by 
the Phong formula varies little within each polygonal face. Why? 

SOLUTION 

All points on a polygonal face share the same normal vector N. When the light is relatively far away, 
vector L (see Fig. 1 1-6) varies little from one surface point to another. (If the light is very distant, say, the sun. 
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then L becomes a constant vector.) Hence L • N, the term that determines the diffuse component in the Phong 
formula varies little within each polygonal face. 


1 1.15 Refer to Fig. 1 1-9, and show exactly how to compute the color of P using bilinear interpolation. 

SOLUTION 

Let the coordinates of P, P, , P 2 , P,, P', and P" be (x,y), <x, ,>’]). (x 2 , v 2 ) - (x',y'), and 

respectively. Also let the color/intensity values at those points be /, 7 lt I 2 , 1 } , and /", and respectively. We 
have 


and 




M ' 


}’2 


y2 -y i 


V r V 

■+h: 


j i 


li r = v Lz» +;) az£. 

' V 2 — vi y% — ,V3 y 2 — y 3 


1 = 1 ' 


x" -x J 


+ 1 " 


x-x 
X" - A-' 


For RGB colors, these fonnulas are applied to the values of each color component. 


11.16 Suppose point Pj with intensity /, is on scan line y i and point P 2 with intensity I 2 is on scan line y 2 . 
Find an incremental formula to compute intensity values V for all the scan lines between P, and P 2 
using linear interpolation in the y direction. 

SOLUTION 

Let P x be the starting point, then the intensity change from one scan line to the next is 

A I = (I 2 — l\)/(y2 — V|) 


Hence 


l{ = /j and J' = //_! -F A I where i = 2, . . . ,y 2 — Vj 


11.17 Refer to Prob. 11.16. If point P l on line 5 has an RGB color ( 1, 0.5, 0) and point P 2 on line 15 has 
an RGB color (0.2, 0.5, 0.6), what is the color for line 8? 

SOLUTION 

Since 

A R = (0.2 - 1 )/( 1 5 - 5) == -0.08 
AG = (0.5 — 0.5)/(15 — 5) = 0 
A B = (0.6 - 0)/( 1 5 - 5) = 0.06 

we have ( 1 + 3 x (—0.08), 0.5 + 3 x 0,0+3 x 0.06) = (0.76, 0.5, 0. 1 8) as the color for line 8. 

11.18 Refer to Fig. I l-10(a). The vertices are shared by the rectangles that approximate the cylinder and 
the two polygons that represent the top and bottom of the object. Does each vertex have a unique 
normal vector? 

SOLUTION 

No. When a vertex is used to describe the top or bottom face, its normal vector is the normal of the 
corresponding face. When it is used to define the rectangle mesh, its normal vector is perpendicular to the 
cylinder as shown in the figure. 
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11.19 Refer to Fig. 1 1-1 1 and see Prob. 11.15. If the two normal vectors at P’ and P" are 

, v/2 \/2 „ sfl s/2 

N' = —■ VI+ — J N = - —— I + — — K 

2 2 2 2 

and point P is half way between P' and P", what is N? 

SOLUTION 

Since P is in the middle of P' and P", the last formula in Prob. 11.15 for linear interpolation in the x 
direction becomes 

/' I" 

I= 2 + l 

Use it to interpolate each component of N' and N"; 

s/2 s/2\ . v/2, , s/l„ , >/2. . v/2. 


(-?-¥) 


4 4 

The magnitude of this vector is 


i+VJ+^-k = — ^-i+^-j+^-k 

4 4 2 4 4 


/ s/2 Y / s/2\ (s/i\ [2 2 2 s/3 

= V5 + T6 + T6 = T- 


Use it to normalize the interpolated vector and get 


N = -^I + ^J + ^K 
3 o o 


1 1.20 Show that when averaging or interpolating normal vectors we will get incorrect result if the vectors 
are not unit vectors (or vectors of equal magnitude). 

SOLUTION 


Consider N, = I and N 2 = J: the average of the two is I + J, which bisects the 90° angle between I and 
J. We can normalize it to get 


s/2 s/2 


Now if we set N 2 = 2J, which lias the same direction as J. the average of N t and N 2 becomes I 4- 2J. 
Clearly, the direction of this vector differs from that of N. 

As for interpolation we can see from Prob. 11.19 that to compute a vector for the point halfway 
between two given points we apply 


/ - 



to each vector component. For N, = I and N 2 = J, this produces 


which is, after normalization, the same as N above. But, if we use N 2 = 2J, the result of interpolation 
becomes 


^I + J 


which has the direction of I + 2J, not the direction of N. 
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1 1.21 Consider the texture-blending formula (1 - k)C -f kC where 0 < k < 1 .0, C is the original color of 
the object, and C' is the color of the texture map. Describe in simple words the results that 
correspond to the two extreme cases: k — 0 and k — 1 . 

SOLUTION 

The case k = 0 means no texture at all, whereas the case k = 1 means painting the texture over the 
original shading of the object. 

1 1 .22 When we use the logical-operation AND to combine the original color of the ob ject and the color of 
the texture map, a texture area with magenta shades will appear unaltered if the original color is 
white, but it will take on red shades if the original is yellow. Why? 

SOLUTION 

Various shades of magenta can be described by RGB color vectors in the form of (m, 0, in ) . The result of 
white (1 , 1, 1) AND (m, 0, m) is (m. 0, m ) , whereas the result of yellow (1, 1,0) AND (m. 0, m) is (m. 0, 0), 
which represents shades of red. 

11.23 See Fig. 11-19, and find the linear functions that map the normalized image onto a square area of 
50 x 50 in the middle of the front face of the cubic object. 



Fig. 11-19 


SOLUTION 

First find a parametric representation tor the target area 

[ X = 0 25 < 0 < 75 

| y = (p 25 < <p < 75 

l 2 = 100 

Note the relationship between the corner points 

u = 0, w = 0 — 6 = 25, <p = 25 
n = 1, w = 0 — 0=75, <p = 25 
u = 0, w = \ —*■ 6 = 25, ip = 75 
a=l, w = 1—0 = 75, ip = 75 

Substitute these into 0 = Au + B and ip = Cw + D. we get 

A = 50, B = 25, C = 50, and D = 25 

Hence the mapping functions are 

0 = 50 u 4- 25 and <p = 50w + 25 

The inverse mapping functions are 

0-25 , <p - 25 

„ = _ „ d »•=— 
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11.24 See Fig. 11-20. Find the linear functions that map the normalized grid pattern onto the bottom 
portion {n/4 < <p <n/2) of the spherical surface in the first octant. 




SOLUTION 


The parametric representation for the target area is 

x = R sin(#) sin(</>) 0 5 9 < n/2 

v = Rcos(tp) tt/4 < (p < n/2 

z = R cos(0) sin ip) 

Note the relationship between the comer points 

u = 0, w = 0 — * 6 = 0, tp = n/2 
u = 1, w = 0 — * 9 — n/2, cp = n/2 
it — 0, w = 1 — »■ 0 = 0, (p — n/4 
u = 1 , w = 1 — ► 6 = n/2, <p = n/4 

Substitute these into 9 =Au + B and <p = Cw+D, we get 


A = n/2, B = 0. 


C = —n/4, and D = n/2 


Hence the mapping functions are 



The inverse mapping timctions are 

9 



and 


n n 

,p = ---w 


and 


w = 


n/2 — <p 
n/4 


Supplementary Problems 


11.25 Given the distribution function P(/ 1) in Fig. 1 1-3, derive a formula to calculate saturation. 


11.26 Why everything looks gray or black in a dark room where we can barely see? 

1 1.27 Can we use Z to convert from chromaticity coordinates (x, y) back to a specific color in the XYZ color space? 


11.28 What ’s the difference between the Y in CMY and the Y in YIQ'l 
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11.29 Consider a sphere of radius R, centered at (x c , y c , z,.). What is the normal vector N at point (x, y , :) on the 
sphere? 


11.30 Find the linear functions that map a 1.0 x 2.0 texture map onto a cylindrical surface in the first octant (see Fig. 
11-21) . The short sides of the texture map should align with the straight edges of the surface. 


♦w 


vv 

U 

0 1.0 



Fig. 11-21 




Ray Tracing 


Ray tracing is a global illumination model that accounts for the transport of light energy beyond the 
direct/local contribution from the light sources. Its operation is largely based on the principles of 
geometric optics. By tracing the path of light rays it is able to integrate the tasks of shading, shadow 
generation, hidden surface removal, projection, and scan conversion into one single computational process. 

In this chapter we first introduce the fundamental concepts that underlie this elegant approach (Sect. 
12. 1 ). We then discuss the basic ray-tracing algorithm (Sect. 12.2), the parametric vector representation of 
a ray (Sect. 12.3), and the mathematics of ray-surface intersection (Sect. 12.4). We also present techniques 
for improving execution efficiency (Sect. 12.5), anti-aliasing (Sect. 12.6), and achieving some desired 
visual effects (Sect. 12.7). 


12.1 THE PINHOLE CAMERA 

In theory one can take perfect pictures using a pinhole camera — a closed box with a tiny hole in the 
center of the front panel (see Fig. 12-1 ). The hole is so small that only one light ray passing through it can 
strike a particular point on the negative, which is mounted on the inside of the back panel. As light rays 
from across the scene expose the negative by hitting their respective target precisely, a sharp image 
depicting the scene emerges. 



Image of object 


Fig. 1 2-1 A pinhole camera. 

Now we place a screen between the pinhole and the object (see Fig. 12-2), causing the light rays to 
intersect the screen. If we can record each light ray as it intersects the screen, we will have a perfect picture 
of the object on the screen. Or, to put this in tenns of image synthesis, if the screen resolution is arbitrarily 
high and we can set the pixel at each intersection point to match the color of the corresponding light ray, 
then the image will be indistinguishable from the real scene when viewed from the position of the pinhole. 
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12.2 A RECURSIVE RAY-TRACER 

In order to construct an image based on the pinhole-camera model we need to determine the light rays 
that are responsible for the pixels. We do so by following their path in the opposite direction, i.e., from the 
viewpo int through the center of each pixel into the scene ( see Fig. 12-3). These are called the primary rays. 
If a primary ray does not intersect any object in the scene, then the corresponding pixel is simply set to 
some background color. 



On the other hand, if a primary ray intersects an object, then the color of the corresponding pixel is 
determined hy the surface shading of the object at the intersection point. Several secondary rays are used to 
compute the three components of this surface shading. The first component is called the local contribution, 
which refers to the direct contribution from the light source(s). We send a shadow ray or illumination ray 
from the surface point to a light source. If the ray is blocked before reaching the light source, the surface 
point is in shadow (relative to this light source). The second component is called the reflected contribution, 
which refers to the reflection of light energy coming from another object surface (inter-object reflection). 
This is determined by a ( specularly ) reflected ray, a ray that represents the reflection of the primary ray with 
respect to normal vector N. The third component is called the transmitted contribution, which refers to the 
transmission of light energy coming from behind the surface (transparent object). This is detennined by a 
(specularly) transmitted ray, a ray that represents the refraction of the primary ray with respect to N. 

Note that, were the object surface a perfect mirror, the specularly reflected ray would represent the sole 
direction of light reflected towards the viewpoint. Also, if the object were made of perfectly homogeneous 
material, the specularly transmitted ray would represent the sole direction of light refracted towards the 
viewpoint. 

Snell’s law determines the relationship between angles a and (i (see Fig. 12-3): 


sin(a) >i 2 
sm(/i) )/, 
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where t] 1 is the refraction index of the medium in which a is defined and t} 2 is the refraction index of the 
medium in which [1 is defined. Although the refraction index of a medium is a function of temperature and 
wavelength, we can often use such values as 1.0 for air, 1.3 for water, and 1.5 for glass. 

Employing the Phong formula to describe the local contribution, we express the intensity 7 of the light 
energy represented by the primary ray as 

/ = /.* a + E / p (* d L, • N + k t { R, • V)*) + U + V, 

1=1 

where the ambient tenri now represents the portion of inter-object reflection that is not accounted for by the 
term k r I r , which describes the reflected contribution as the product of the reflection coefficient k r and the 
intensity 7 r of the reflected ray; the last term kj x describes the transmitted contribution as the product of the 
transmission coefficient A, and the intensity 7, of the transmitted ray. Both k r and k { are in the range of 
[0, 1 .0], 7 r and 7 t are obtained recursively by treating the reflected ray and the transmitted ray as a primary 
ray. In other words, we calculate 7 r by evaluating the above equation at the closest surface the reflected ray 
intersects, in exactly the same way as the primary ray is handled. A pseudo-code description of the basic 
ray-tracing algorithm is as follows: 

rayTrace { ray, depth, color} 

{ 

detennine closest intersection of ray with an object; 
if (no intersection) color = background; 
else { 

color = local contribution; 

if (depth > 1) { 
calculate reflected ray; 
ray Tracef reflected ray, depth- 1, ref_color); 
calculate transmitted ray; 
rayTrace( transmitted ray, depth- 1, trans_color); 
color = color + k r ■ ref_color -f k t ■ trans_color; 

} 

} 

} 

This procedure takes a ray and a depth value that is greater than or equal to 1 as input, and returns a 
color as output (typically an RGB vector, the above formula is used to compute the intensity of each 
individual color component). It treats the given ray as a primary ray and (when depth > 1) makes recursive 
calls to obtain the color (ref_color) of the reflected ray and the color (trans_color) of the transmitted ray. 

Figures 12-4, 12-5, and 12-6 show the results of ray-tracing a scene consisting of three opaque spheres. 
The smaller one in the front is dull whereas the two larger ones in the back are shiny. The first image 
(depth = 1) is obtained using only the primary ray and shadow ray. The second image (depth = 2) depicts 
the effect of adding reflected contribution (one level beyond the primary ray). The third image (depth = 3) 
show's the cumulative effect of two levels of reflected contribution. In Fig. 12-7 (also depth = 3), the dull 
sphere is replaced by a shiny glass sphere (a check-board floor is also added) to illustrate the combined 
effect of all three shading components. 


12.3 PARAMETRIC VECTOR REPRESENTATION OF A RAY 

A ray is not a vector although it may look like one (see Fig. 12-3). The difference between the two is 
that, while a vector is defined by its direction and magnitude, a ray is determined by its direction and 
starting point. 
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Fig. 12-4 



Fig. 12-5 


We may represent a ray in terms of two vectors: s to specify its starting point, and d to describe its 
direction (see Fig. 12-8). These two vectors are used to provide a parametric vector representation for the 
ray: 


r(/) = s + td (0 < t) 


where r(/) denotes a family of vectors. When the tails of these vectors are placed at the origin, their heads 
make up the ray. 
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Fig. 12-6 




Fig. 12-8 Vector representation of a ray. 
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12.4 RAY-SURFACE INTERSECTION 

A critical step in the implementation of ray tracing is to determine whether a ray intersects an object. 
This operation is perfonned for each primary ray (or a ray that is treated as a primary ray in a recursive call) 
and shadow ray. If a primary ray does intersect an object, we need to know exactly where the intersection 
point is. 


Coordinate System Plane 

Intersection of a ray with the xy plane can be determined by solving the following for t (see Fig. 1 2-9): 

s + ?d = .v,I + y, J 

With s = .v 5 I + y 5 J + z s K and d — ,v d I +y d J + z d K, we have 

-v s + tx A = x, 

■ v 8 + tv’d =yt 

z s + tz i = 0 

Wlren z d = 0, the ray is parallel to the plane (no intersection). When z s = 0, the ray originates from the 
plane (no intersection). Otherwise, we calculate t using the third equation 



z a 


If t < 0, the negative extension of the ray intersects the plane. On the other hand, if / > 0, the ray itself 
intersects the plane and the coordinates x t and y, of the intersection point can be calculated from the first 
two equations. 



Fig. 12-9 

Intersection with other coordinate system planes and planes that are parallel to a coordinate system 
plane can be handled similarly (see Probs. 12.9 and 12.30). We can also test to see if the intersection point 
is within a region (see Probs. 12.10, 12.11, and 12.12). 


Arbitrary Plane 

Let n — .v n l +y„J + z n K be the normal vector to an arbitrary plane and P 0 (x 0 ,y 0 , z 0 ) a point on the 
plane (see Fig. 12-10). The vector equation or point-normal equation of the plane is then ( from App. 2, 
Sect. A2.3) 

.v„(x - x 0 ) +y„(y - y 0 ) + z„(z - z 0 ) = 0 

To determine if a ray s + rd (s = x s I + y s J + z s K and d = v d I +y d J + z d K) intersects the plane, we 
substitute x s + 1 r d , y s + rt d . and z s + tz a for x, y, and z, respectively: 

*„(*s + Ah - a- 0 ) +y„(y s + ri'j -y 0 ) + z„(z» + tz d - z 0 ) = 0 
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Solving for t we get 


- V n<-N ~ *o) + YnO’s ~ Vp) + ~ *o) 

Vd +W’d+Vd 


Introducing p = ( x 0 — x s )I + (y 0 — y s )J + (z 0 — z s )K, we can express f in vector form: 


n d 


Wien n d = 0, the ray is parallel to the plane (no intersection). Otherwise, if t < 0, the negative 
extension of the ray intersects the plane; if t — 0, the ray originates from the plane. The ray intersects the 
plane only when t > 0. 

Sometimes we want to further distinguish the two cases of intersection; intersecting the outside (front) 
of a plane versus intersecting its inside (back). The former occurs when n d < 0 whereas the latter occurs 
when n d > 0. 


Sphere 

Consider a sphere of radius R centered at (x c ,y c ,z r ) and an arbitrary point (x,y, z) on the sphere (see 
Fig. 12-11). Define vectors c = x c I +_y c J + z c K and p = xl + vJ + zK. The vector equation for the sphere 
is 

IP — c| =R 

To determine if a ray r(/) = s + /d intersects the sphere, we substitute s + rd and p and square both 
sides: 

|s + fd - c| 2 = R 2 

Since v v — |v| 2 for any vector v (see Prob. A2.3), we have 
|s — c + rd| 2 = (s — c + rd) • (s — c + td) 

= (s — c) • (s - c + rd) + rd • (s — c + fd) (Prob. A1 .20) 

= (s - c) • (s — c) + 2(s — c) fd -|- fd fd (Prob. A 1.1 9) 

= |s — c| 2 + 2f(s — c) d -i- C|d| 2 = R 2 
This is in the form of a quadratic equation in f: 

At 2 + 2Bt + C = 0 
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Fig. 12-11 


where A — |d| 2 , B — (s — c) • d, C = |s — c| 2 — R 2 , and the solution is 


t 


-B±s/B 2 -AC 

A 


with 


B 2 -AC 


< 0 
= 0 
> 0 


no intersection 

ray (or its negative extension) touching sphere 
two (possible) intersection points 


When B 2 — AC > 0 we get two t values: t y and t 2 . If both are less than 0, the negative extension of the ray 
intersects the sphere (no intersection by the ray). If one of them is 0, the ray starts from a point on the 
sphere and intersects the sphere only if the other value is positive. If the two values dilfer in signs, the ray 
originates from inside the sphere and intersects the sphere once. If both are positive, the ray intersects the 
sphere twice (first enters and then exits), and the smaller t value corresponds to the intersection point that is 
closer to the starting point of the ray. 


General Implicit Surface 

Generally, to determine if a ray s + rd (s = x s I + _y s J + z s K and d = x d I +y d J + z d K) intersects a 
surface that is represented by an implicit equation F(x, y, z) = 0, we solve 

F(x s + tx d ,y, + n d , z s + tz A ) = 0 


for 1 (see Probs. 12.16, 12.17, and 12.31). 


12.5 EXECUTION EFFICIENCY 

Ray-tracing is time-consuming largely due to the recursive nature of the algorithm and the demanding 
task of computing ray-surface intersections. Several techniques are designed to improve execution 
efficiency by means of object/scene-dependent deployment of system resources. 
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Adaptive Depth Control 

Ail opaque object (k t = 0) does not transmit light. A dull object (k r = 0) does not produce specular 
reflection. In these extreme cases there is clearly no need to trace the transmitted /reflected ray, since the 
transmitted/reflected contribution is zero regardless of the value of IJl r 

Even in a more general situation, say, k r > 0, the cumulative effect of the reflection coefficients of the 
surfaces along the path of reflection can render the contribution from additional recursive steps negligible. 
For example (see Fig. 12-12), if the reflection coefficients of the objects along the path of reflection are k rl , 
k {2 , and k ri , respectively, the eventual contribution of I ri to the primary ray is k Tl k l . 2 k r2 l li . When 
A rl = k r2 = k ri =0.1, this contribution is 0.00 l/ r3 (negligible in most applications). 



Fig. 12-12 


Using adaptive depth control, the ray-tracer keeps track of the cumulative attenuation along the path of 
reflection (and transmission in a similar manner) and, before working on another reflected ray, compares 
the current value of cumulative attenuation to a present threshold. It continues its recursive execution to the 
required depth only if the cumulative attenuation does not fall below the threshold. For the example in Fig. 
12-12, if k Tl = k r2 = k ti = 0.1 and the threshold is 0.01 , then the ray-tracer will not go beyond the second 
reflected ray because k r] k r2 k r7 , <0.01. 


Bounding Volume Extension 

Since a ray travels along a narrow path in one specific direction, it normally misses most of the objects 
in the scene. This means that an object is more likely to get a “no” answer in a ray-surface intersection test 
than a “yes” answer. The purpose of this technique is to identify objects, especially complex objects, that 
are definitely not intersected by the ray as quickly as possible. 

Each object or group of objects in close spatial proximity is surrounded by a capsule/bounding 
volume (e.g., a sphere, a box, etc. ) that permits a simple intersection check. If the ray does not intersect the 
capsule, it does not intersect the object(s) inside the capsule, and no further testing involving the object(s) 
is necessary'. On the other hand, if the ray does intersect the capsule, further testing involving the enclosed 
object(s) is necessary in order to reach a conclusion (“yes” or “no”). 


Hierarchy of Bounding Volumes 

The bounding volume technique can be extended to a hierarchy of bounding volumes. For example, if 
a monitor and a printer are on top of a desk, we may introduce a large bounding volume encompassing all 
three objects, and three smaller bounding volumes inside the large one for the individual objects, 
respectively. Only if a ray intersects the large bounding volume it is necessary to go to the next level of 
the hierarchy to test against tire three smaller bounding volumes. 
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Bounding volume techniques incur two types of overhead: maintaining bounding volumes and testing 
against bounding volumes. It is often worth the effort when the objects are complex and are not loosely 
distributed across the scene (see Prob. 12.20). 


Spatial Coherence/Spatial Subdivision 

This technique is based on the observation that only objects that are in the vicinity of the path of a ray 
may be intersected by the ray, objects that are away from the path do not have anything to do with the ray 
(and should be ignored in intersection testing). 

We subdivide the scene space into regions (see Fig. 12-13 for a 2D illustration) and associate objects 
with the regions in which they appear. In other words, each region is linked to a list of objects that appear 
in that region. Since the regions are typically regularly spaced and aligned with the coordinate system 
axes/planes, only a minimum amount of computation is necessary to identify the regions that are 
intersected by a given ray (see Prob. 12.21). The actual test for ray -surface intersection is then performed 
only with the objects that appear in the identified regions. 



Fig. 12-13 


Furthermore, we follow the ray from one region to the next and perform intersection tests as we enter 
each region (since an object may be associated with more than one region, care should be taken to avoid 
repeating intersection tests with tire same object). Once an intersection point (closest to the starting point of 
the ray) is identified with the objects in the current region, there is no need to move forward into other 
regions. 

The overhead for spatial subdivision is mainly the need to preprocess the scene in order to assign 
objects to regions. The technique is complementary to the bounding volume approach in that it is quite 
capable of dealing with objects that are scattered around the scene. 


12.6 ANTI-ALIASING 

Ray-tracing depicts a continuous scene by taking discrete samples from the scene. Hence the aliasing 
artifacts associated w'ith scan-conversion (e.g., jagged edges, dropout of small parts/objects) are also 
present in ray-traced images. The following are some anti-aliasing techniques that can be incorporated into 
the ray-tracing process to alleviate the problem. 


Supersampling 

Each pixel is divided into subpixels (see Chap. 3, Fig. 3-27) and a separate primary ray is sent and 
traced through the center of each subpixel. The color/intensity values of the primary rays for the subpixels 
are then combined (averaged) to produce a value for the parent pixel. 
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Adaptive Supersampling 

Supersampling often drastically increases the computational burden of ray tracing. In this approach we 
send one ray through the center of a pixel and four additional rays through its comers. If the five rays return 
similar colors the pixel will not be subdivided (it probably corresponds to a smoothly shaded area in the 
scene). Only if the returned colors are sufficiently different do we subdivide the pixel (it probably covers 
both sides of an edge) into 2x2 subpixels. Each subpixel is then handled in the same way, and the process 
terminates when a preset level of subdivision is reached. 


Stochastic Supersampling 

The effect of supersampling can often be enhanced with stochastic or distributed ray-tracing. In this 
approach we deviate from using the fixed (sub)pixel grid by scattering the rays evenly across the pixel area 
in a random fashion. A typical way to achieve this is to displace each ray from its normal position in the 
grid (see Prob. 12.24). 


1 2.7 ADDITIONAL VISUAL EFFECTS 

Various techniques have been developed to achieve certain desirable visual effects. Some methods 
below are applications of distributed ray-tracing, which means that we scatter rays around with respect to a 
certain parameter in the ray -tracing process (recall that in stochastic supersampling we displace rays from 
their normal position in the pixel grid). 


Environment Mapping 

A shiny (mirror-like) object reflects the surrounding environment. Instead of ray-tracing the three- 
dimensional scene to obtain the global reflection, we may map a picture of the environment onto the object 
(see Fig. 12-14). The object is typically placed in the middle of an enclosure such as a cube, cylinder, or 
sphere, with the environment map attached to the inside of the enclosing surface (facing the object). The 
color of a pixel is then a function (e.g., average by supersampling) of the area in the environment map that 
corresponds to the pixel. 

Clearly this is only an approximation to ray-tracing the three-dimensional scene. The quality of the 
mapped reflection is dependent on the size (relative to the enclosure) and shape of the object. However, 



Fig. 12-14 
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environment mapping is often faster, and it is particularly useful in creating the illusion of a computer- 
generated shiny object amidst a real-world scene (which of course cannot be ray-traced). 


Soft Shadow 

Ray-traced images of scene involving fixed point-lights are characterized by the harsh edges of the 
shadow areas. However real lights are not mathematical points and they cast soft shadows , which consist of 
an umbra area surrounded by a penumbra area (see Fig. 12- 15). 



Fig. 12-15 


In order to produce soft shadows we model a light source by a region, called an area light. The area is 
subdivided into subareas or zones. Shadow rays are then distributed to these zones via random selection 
(with equal probability for each zone or weights that favor zones that correspond to higher intensity 
values). 


Blurry Reflection 

To get a blurry reflection of the surroundings on a glossy (not mirror-like) object, we distribute the 
reflected rays. This can be done by displacing a reflected ray from the position of its mirror reflection by a 
small angle. The distribution of the angle is subject to the same bell-shaped reflectance function that 
governs specular highlights (see Fig. 1 1-6). 


Translucency 

The difference between transparency (specular transmission) and translucency is somewhat analogous 
to the difference between specular reflection and blurry reflection. To achieve the effect of light going 
tlirough translucent material, we distribute the transmitted rays by displacing each from the position of 
specular transmission by a small angle. The distribution of the angle is subject to a transmittance function 
similar to the aforementioned reflectance function. Furthermore, we may attenuate the transmitted intensity 
to account for the loss of energy. 


Motion Blur 

A fast-moving object tends to look blurry in a photograph (especially if the photo was taken with a low 
shutter speed). To mi mi c this phenomenon we distribute rays over time. In other words, we predetermine 
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the path or a series of positions of a moving object based on the characteristics of the movement. We then 
displace the object along its path or between positions as each ray is traced. 


Solved Problems 


12.1 Describe how hidden surface removal and projection are integrated into the ray-tracing process. 

SOLUTION 

Each primary ray plays the role of a projector that maps a surface point P to its image F in the view 
plane. If all primary rays emanate from a view point C we have the effect of perspective projection. If all 
primary rays are parallel to each other we have the effect of parallel projection. Furthermore, if a primary ray 
intersects several object surfaces, only the surface that is the closest to the view plane is chosen to determine 
the color of the corresponding pixel; all others are effectively treated as hidden surfaces and removed (not 
displayed). 

12.2 Name the three components of surface shading and the secondary ray for computing each. 

SOLUTION 

Local contribution (shadow ray), reflected contribution (specularly reflected ray), and transmitted 
contribution (specularly transmitted ray). 

12.3 The refraction index of a medium is the ratio of the speed of light in vacuum to the speed of light in 
that medium. Rewrite Snell’s law' in terms of the speed of light in the two participating media. 

SOLUTION 

Let c be the speed of light in vacuum, c l be the speed of light in the medium whose refraction index is //, , 
and c 2 be the speed of light in the medium whose refraction index is »; 2 . We have = c/c t and i/ 2 = c/c 2 . 
Hence Snell’s law can be written as 

sin(oe) c/c 2 c, 

sin(/?) )/, c/c x c 2 

12.4 What is the difference between a vector and a ray? 

SOLUTION 

A vector is defined by its direction and magnitude, whereas a ray is determined by its direction and 
stinting point. 

12.5 A ray is represented by r(f) = s + /d where s = 21 + J — 3K and d = I + 2K. Find the coordinates 
of the points on the ray that correspond to t = 0, 1, 2.5, and 3, respectively. 


r(0) = (2 + 0)1 + J +• (-3 + 0)K = 21 + J - 3K — (2, 1 , -3) 
r(l) = (2 4- 1)1 + J+ (—3 + 2)K = 31 + J — K — (3, 1, -1) 
r(2.5) = (2 + 2.5)1 + J+(-3 + 5)K = 4.51 +J + 2K— * (4.5. 1,2) 
r(3) = (2 + 3)1 + J + (-3 + 6)K = 51 + J + 3K ->(5.1. 3) 


SOLUTION 
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12.6 Let the viewpoint be at (a, b, c) and the center of a pixel at (.v, y. z). Find vectors s and d to represent 
the corresponding primary ray. 

SOLUTION 


s — u\ T bJ T cK 
d = (x — a)I + (y — b)i + (z — c)K 


12.7 Lines in two-dimensional space can be represented by either the algebraic equation y — nix + b or 
the parametric vector equatiou L(/) = s + td, where — oo < t < +oc. For s = I + J and d = I — J, 
find the equivalent algebraic representation. 

SOLUTION 1 

Since L(/) = s + /d = (1 + t)I + (1 — t)J, we have 

x = 1 4 - 1 and y = 1 — t 

Hence x +y = 2, or y = — x + 2. 

SOLUTION 2 

Find two points on the line 

L(0) = s — 1+ J ->• (I, 1) 

L( I ) = s + d = 21 — >• (2, 0) 

We have (y — 1 )/(x — I ) = (0 — 1 )/(2 — 1 ), i.e., y = — x + 2. 


12.8 Refer to Fig. 12-1 6(a), where d represents the direction of the primary ray, t the direction of the 
transmitted ray, and n the normal vector at the intersection point. Express t in terms of d, n, and the 
two angles a and /?. 



Fig. 12-16 


SOLUTION 

Let d, t, n be unit vectors [see Fig. 12-16(6)]. We have 


m = d + cos(a)n 
|m| = sin(a) 
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and 


t = sin(/?)u m — cos(/?)n 

= S ^^ (d + cos(x)n) — cos(/?)n 
sin(a) 


12.9 Determine if a ray intersects a plane that is parallel to the at plane. 

SOLUTION 

The equation for such a plane is z = c where c is a constant. If a ray does intersect the plane, the 
intersection point is at (x„y„ c). In order to find x, and y, we solve the following for t: 

s -F /d = x,l -Fy,J + rK 

With s = x s I + y s J -(- : S K and d = x d I +- y ri J + z d K, we have 

I N 4- /x d = x, 

,v s + ty d = y, 

A + *-d = c 

When z d = 0, the ray is parallel to the plane (no intersection). When z s = c, the ray originates from the plane 
(no intersection). Otherwise, we calculate t using the third equation 


-a 

If t < 0, the negative extension of the ray intersects the plane. On the other hand, if / > 0, the ray itself 
intersects the plane and the coordinates x, and y, of the intersection point can be calculated from the first two 
equations. 


1 2.1 0 Determine if a ray intersects a rectangular region defined by x mm , x max , y min and y mM in the xy plane. 

SOLUTION 

First determine if the ray intersects the xy plane (see Sect. 12.4 under “Coordinate System Plane’’). If not, 
the ray does not intersect the region. Otherwise, find the coordinates x, and y, of the intersection point. If 
x ni in — x i— Xmax an< J .'min — >/ — .'max ray intersects the region; otherwise it does not. 


1 2.1 1 Determine if a ray intersects a triangular (or convex polygonal) region in the at plane. 

SOLUTION 


First detennine if the ray intersects the xy plane (see Sect. 12.4 under “Coordinate System Plane”). If not, 
the ray does not intersect any region in the plane. Otherwise, find the coordinates x, and y, of the intersection 
point. The point is inside a triangular or convex polygonal region if it is on/inside all bounding edges of the 
region. 

Now focus on just the xy plane (ignore :). To test if the intersection point (x,. v,) is inside or outside an 
edge in the plane, we choose a point on the edge (e.g., one of its endpoints) and define an outward nonnal 
vector n that points to the outside of the edge (see Fig. 12-17). We also use the chosen point to define vector v, 
and the intersection point to define vector v,. We have 


n • (v, - v) 


> 0 
= 0 
< 0 


(x„y,) is outside the edge 
(x,,y,) is on the edge 
(x,,y,) is inside the edge 


(6 < 90°) 
(0 = 90“ ) 
(6 > 90 ) 


12.12 A unit square is placed in the xy plane with one corner at the origin and the diagonal comer on the 
positive y axis [see Fig. 12-1 8(a)]. Determine if a ray emanating from (0, 1,2) going in the direction 
of the negative z axis intersects the square. 
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(a) 


<*> 


Fig. 12-18 


SOLUTION 


From the given condition we have 

s = J + 2K and d = -K 

Since z s = 2 ^ 0, z d = — 1 / 0, and t = —zJz A = 2 > 0, the ray intersects the xy plane (see Sect. 12.4 under 
“Coordinate System Plane”). Using x s = 0, v d = 0, y s = 1, and y d = 0, we get 

x,=0 + 2x0 = 0 and y t = 1 + 2 x 0 = 1 

To determine if the intersection point is inside the square, we may follow the solution for Prob. 12.10. 
Alternatively, we may perform a coordinate transformation from system xyz to x'y'z' in order to align the .v' and 
v' axes with the sides of the square [see Fig. 12-18(6)]. This requires a 45° rotation of the xyz system with 
respect to the z axis (or the origin if we ignore the z dimension). The square is now bounded by x mm = 0, 
x raax = 1 , y mil , = 0, and y max = 1 in the new coordinate system, where the coordinate of the intersection point 
are (see Sect. 4.2) 

/? 

x'j = x, cos(45°) + >•, sin(45 ) = — 

, sfl 

y\ = — Xj sin(45‘) +_v ; cos(45°) = — 

Since 


< * < x n] 


and 


£ v, £ Mr 


the ray does intersect the square. 


12.13 Let n = I + J + 2K be the normal vector of a plane that passes through point P 0 (1,1,0). 
Determine if a ray with s — —21 + J + 2K and d = I — K intersects the plane. 
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SOLUTION 

Refer to Sect. 12.4 under arbitrary plane. Since 

n • d = 1 x 1 + 1 x0 + 2x(— 1) = — 1 <0 
the ray intersects the plane. We introduce 

P = (+i - *,)I + tv 0 - >'»)J + (z 0 “ 

= ( 1 — (— 2))I + ( 1 — 1 )J + (0 — 2)K 
= 31 — 2K 

and compute 

np 1x3 + 1 x0 + 2x (—2) ^ 

n • d — 1 

The intersection point is at (jc, + x d , v s +y d , - s + -a) = ( — L L l). 


12.14 Double check that the intersection point found in Prob. 12.13 is indeed on the given plane. 

SOLUTION 


The vector equation for the plane is 

x„(x - x„) +y„(v - y 0 ) + ~ - 0 ) = 0 

Since 


1(-1 - 1 )+ 1(1 - 1 ) + 2(1 - 0 ) = 0 
the intersection point (—1, 1, 1) is indeed on the plane. 


12.15 Let be a sphere of radius 8 centered at (2,4,1) and S 2 a sphere of radius 10 centered at 
(10, —2, —5). Determine if a ray with s = 2J + 5K and d = 1 — 2K intersects the spheres. 

SOLUTION 


First consider sphere 5) . We have 

,4 = |d| 2 = l 2 + (— 2) 2 = 5 

B = (s - c) ■ d = [(0 - 2)1 + (2 - 4)J + (5 — 1)K] ■ (I - 2K) = -10 
C = |s - c| 2 - R 2 = (— 2) 2 + (— 2) 2 + 4 2 - 8 2 = -40 
Since B 2 — AC = ( — 1 0) 2 — 5( — 40) = 300 > 0, there are two t values: 

10- s/300 


IO+s/300 

h =—^r — = 2(1 + \fy) > o 


and 


h = 


= 2(1 - V3) < 0 


From these we know that the ray starts from inside S { . Now considering sphere S 2 , we have 
A = |d| 2 = l 2 +(-2) 2 = 5 

B = (s - c) • d = [(0 - 10)1 + (2 + 2)J + (5 + 5)K] • (1 - 2K) = -30 
C = |s-c| 2 -R 2 = (-10) 2 + (4) 2 + 10 2 - 10 2 = 116 
Since B 2 — AC = (— 30) 2 — 5 x 116 = 320 > 0, there are also two t values: 


30+ n/320 r- 

= = 6+1 .6\/5 > 0 


30 - s/320 r 

and t A = = 6—1 .6\/5 > 0 


From these we know that the ray intersects S 2 in two different places. By comparing ~ 5.464, l } & 9.578, 
and t 4 s=» 2.422, we see that the ray first enters S 2 , then leaves 5), and finally exits from S 2 (the two spheres 
overlap each other). 
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12.16 The (implicit) equation for a sphere of radius R centered at the origin is xr +y 2 + z 2 — R ? = 0. 
Determine if a ray s + /d intersects the sphere. 

SOLUTION 


Let s = x s I + v s J 4- z s K and d = x d I + v d J +- d K. Substitute x s 4- rx d , v s 4- ty A , and z s 4- fe d forx, v, and 
z, respectively: 

(x s + lx A )~ 4- (y s 4- ty A ) 2 4- (z s + /z d r — R- = 0 

Expand and regroup terms: 

(x d 4- v d 4- Zi)t 2 + 2(x s x d + y s v A 4- z s z A )l 4- (xj + y% 4- zj) — R 2 =0 


or 

|d|Y + 2s • d< 4- |s| 2 - /f 2 =0 

This matches the quadratic equation derived in the text (see Sect. 12.4 under “Sphere”), with the center of the 
sphere being set to (0, 0, 0). 

1 2.1 7 The (implicit) canonical equation for an elliptic paraboloid is x 2 + y 2 - z = 0 (see Chap. 9, Sect. 
9.10, Fig. 9-19). Determine if a ray s 4- fd intersects the paraboloid. 

SOLUTION 


Let s = x s l 4- y s J +z s K and d = x d l 4-y d J 4-z d K. Substitute x s 4- fx d ,y s 4- ty A , and z s 4- fc d for x, y, and 
z, respectively: 


(x $ 4- tt d ) 2 4- (v s 4- ty A )~ — (z s 4- tz A ) — 0 


Expand and regroup terms: 

(4 +>i)‘ 2 4- (2x s x d 4- 2 yy A - z A )t 4 (x^ 4 v 2 - z s ) = 0 
When the ray is parallel to the z axis (i.e., x d = 0 and y d = 0), the equation degenerates into 

—-a' +-^ Tv* — z s = 0 

From this we can find 


where 



I < 0 ray's negative extension intersects paraboloid 
= 0 ray's starting point on paraboloid 
> 0 ray intersects paraboloid 

Otherwise, we can rewrite the quadratic equation as 

At 2 + Bt + C = 0 


where 

A = x d 4 - y d B = 2x s x d 4- 2 v fc v d - z d C = x 2 4 .v 2 - z s 
and the solution for the equation is (note A ^ 0) 

-B ± s/B 2 - 4 AC 
’ ~ 1A 

If B 2 — 4 AC < 0, the ray does not intersect (or touches) the paraboloid. If B 2 — 4 AC = 0 and / > 0, the ray 
intersects (or touches) the paraboloid once. If B 2 — 4 AC > 0, we get two t values; q and U. If q <0 and 
t 2 < 0, the negative extension of the ray intersects the paraboloid (no intersection by the ray). If one of the two 
values is 0, the ray starts from a point on the paraboloid and intersects the paraboloid only if the other value is 
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positive. If /| and l 2 differ in signs, the ray originates from inside the paraboloid (i.e., x? +yj: — z s < 0) and 
intersects the paraboloid once. If both values are positive, the ray intersects the paraboloid twice (first enters 
and then exits), and the smaller value corresponds to the intersection point that is closer to the starting point of 
the ray. 


12.18 Figure 12-19 shows a ray s + fd and a canonical paraboloid (see Prob. 12.17) that is defined in its 
own coordinate system x'y'z’. Describe how to make use of the result in Prob. 12. 1 7 to determine if 
the ray intersects this paraboloid. 





Fig. 12-19 


SOLUTION 

Find the coordinate transformation M from system xyz to x[v'- . Apply M to transform (x,,v 5 , z s ) to 
(.*', / s , z') and define s’ = a' I + y' J + r' K. Apply M to transform the origin (0,0, 0) of xyz to ( 4 , 4 , z 0 ), and 
transform (x d , y d . z d ) to (x d , v d , z d ). Define d = (x d - a'o)I + (y d _ >u)J + (- d — 4>)K-> which describes the 
same vector as d. Now represent the ray by s' 4- /d in the x'y'z coordinate system. Use the solution to Prob. 
12.17 to determine if the ray intersects the paraboloid. 

12.19 Refer to Fig. 12-12. If k rl — 4.3 = 4-3 — 0-3, the threshold is 0.01, and the desired depth is 2, will 
the ray-tracer proceed to work on the third reflected ray? 

SOLUTION 

No, since the desired depth is 2. It w'ould if the desired depth were 3 because 4i = 0.027 > 0.01. 

1 2.20 Describe a scene w'here the hounding volume techniques are definitely not applicable. Explain why. 

SOLUTION 

A number of spheres scattered across the scene. First of all, there is no simpler bounding volume for 
intersection testing. Furthermore, any bounding volume that encompasses several spheres is going to occupy a 
relatively large section of the scene. This brings about a relatively high probability for a ray to intersect the 
bounding volume, resulting in further testing against the enclosed spheres (something the bounding volume is 
supposed to help avoid in the first place). 

12.21 In spatial subdivision we need to identify the regions along the ray’s path. Since the ray exits from 
the current region and enters into the next region at the same point in space, a basic operation to 
guide the ray-tracer from one region to the next is: given the direction d of the ray and its entry 
point P l into the current region, find point P 2 through which the ray exits. Describe the operation 
for a parallelepiped region whose faces are parallel to the coordinate system planes. 
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SOLUTION 

Such a region is bounded by the following planes: 

x tniin x — 

y Tram y Traax 

- 2 min - -max 

Since we are looking for the exit point, only planes that satisfy n d > 0, where n is the plane’s nonnal vector, 
should be considered. The nonnal vectors of the planes listed above are simply — L I, —.1, .), — K, and K, 
respectively. Now we use / J , to be the starting point of the ray. For each chosen plane, say, 2 = z min , the t value 
that represents where the ray intersects the plane is (see Prob. 12.8) 

j ~min ~ ~s 

“d 

(see Prob. 12.22 for similar fonnulae for the other planes). Once all eligible planes have been processed, the 
smallest I value from the calculation represents P 2 . 


12.22 Refer to Prob. 12.21, and write down the formulae for computing t for the other five planes. 

SOLUTION 

The formulae tor planes x = x mm , x = x mxi , y y =y max , and 2 = 2 max are 

t Turn x s f Tnax N ^ Tmin As ^ Tmax As ^ -max ~s 

x d x d I'd y'd — d 

respectively. 


12.23 Consider supersampling and adaptive supersampling. Which one is likely to perform better in 
reducing dropout? 

SOLUTION 

Supersampling is likely to perform better since the adaptive method will not subdivide a pixel if the 
center ray and comer rays return similar colors (but a small object may lie between those rays). 


12.24 How to displace a ray from its nonnal position in the pixel grid? 

SOLUTION 

Let (x,y) be the rays nonnal position in the grid. Define Ax and Ay such that [x — Ax. x + Ax] and 
[v — Ay, y + Ay] encompass the area for the new position (normally the area of the corresponding subpixel). 
Choose two random numbers d x and <5, in the range of [—Ax, Ax] and [—Ay, Ay], respectively. The 
new/ displaced position for the ray is (x + d x ,y + S v ). 


12.25 Illustrate w'hy the relative size of an object affects the quality of environment mapping. 

SOLUTION 

The environment map is obtained by projecting the scene (a pyramid) from viewpoint C [see Fig. 
12-20(a)]. When a small object is placed at C, the mapped reflection approximates the result of ray-tracing 
with the actual pyramid [see Fig. 1 2-20(/>)]. On the other hand, if a relatively large object is placed at C, we 
may see a falsely reflected pyramid on the object when the pyramid is really behind the object [see Fig. 
12 -20(c)]. 
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Fig. 12-20 

12.26 Explain why the shape of an object affects the quality of environment mapping. 

SOLUTION 

A concave object may reflect part of itself. However, since the object is not depicted in the environment 
map, any self-reflection will be missing from the result of environment mapping. 


Supplementary Problems 


12.27 Snell's law is a consequence of Fermat's principle in optics: light travels tram point A to point B along the path 
that takes the shortest time. Let A be in one medium where the speed of light is C|, and B be in another 
medium where the speed of light is c 2 (see Fig. 12-21 where the x axis separates the two media). The path that 
takes the shortest time must be a straight line from A to a point P on the x axis followed by a straight line from 
P to B (since the light ray has to cross the x axis at some point and the shortest path within a medium is a 
straight line). Prove that the following holds for this path: 

sin(a) C] 
sm(/3) c 2 



12.28 The following pseudo-code procedure shows another way to describe the basic ray-tracing algorithm. The 
parameters are identical to those used in the procedure in Sect. 12.2, with the exception that the range for 
depth now includes value 0. Do these two procedures produce identical results when invoked with the same 
call? If yes, are they the same in terms of execution efficiency? 
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rayTrace(ray, depth, color) 

{ 

if (depth < 1) color = black; 
else { 

determine closest intersection of ray with an object; 
if (no intersection) color = background; 
else { 

color = local contribution; 

calculate reflected ray; 

rayTrace(reflected ray, depth- 1, ref_color): 

calculate transmitted ray; 

ray Trace (transmitted ray, depth- 1, trans_color); 

color = color + k r ■ ref_color + A, • trans_color; 

} 

} 

} 

12.29 Refer to Prob. 12.7, and convert y = 2x — 6 to parametric vector representation. 

12.30 Determine if a ray intersects the yz plane. 

12.3 1 The implicit equation for a cylinder of radius R along the j-axis is x 2 +y* — R 2 = 0. Determine if a ray s + fd 
intersects the cylinder. 


1 2.32 Show that the solution to equation At 2 + 2 Bt + C = 0: 

-B±VB 2 - AC 
' ~ A~ 

and the solution to equation At 2 + Bt + C = 0 


t = 


-B ± s/B 2 - 4 AC 
2A 


are essentially the same. 




Mathematics for 
Two-Dimensional 
Computer Graphics 

The key to understanding how geometric objects can be described and manipulated within a computer 
graphics system lies in understanding the interplay between geometry and numbers. While we have an 
innate geometric intuition which enables us to understand verbal descriptions such as line, angle, and 
shape and descriptions of the manipulation of objects (rotating, shifting, distorting, etc.), we also have the 
computer’s ability to manipulate numbers. The problem then is to express our geometric ideas in numeric 
fonn so that the computer can do our bidding. 

A coordinate system provides a framework for translating geometric ideas into numerical expressions. 
We start with our intuitive understanding of the concept of a two-dimensional plane. 

Al.l THE TWO-DIMENSIONAL CARTESIAN COORDINATE SYSTEM 

In a two-dimensional plane, we can pick any point and single it out as a reference point called the 
origin. Through the origin we construct two perpendicular number lines called axes. These are traditionally 
labeled the x axis and the y axis. An orientation or sense of the plane is determined by the positions of the 
positive sides of the x and y axes. If a counterclockwise rotation of 90' J about the origin aligns the positive x 
axis with the positive y axis, the coordinate system is said to have a right-handed orientation [see Fig. 
Al-l(fl)]; otherwise, the coordinate system is called left-handed [see Fig. A 1-1 (/>)]. 

The system of lines perpendicular to the x axis and perpendicular to the y axis forms a rectangular grid 
over the two-dimensional plane. Every point P in the plane lies at the intersection of exactly one line 
perpendicular to the x axis and one line perpendicular to the y axis. The number pair (x, y) associated with 
the point P is called the Cartesian coordinates of P. In this way every point in the plane is assigned a pair 
of coordinates (see Fig. A 1 -2). 

Measuring Distances in Cartesian System 

The distance between any two points P { and P 2 with coordinates (xj ,yy) and (x 2 ,y 2 ) can be found with 

the formula 

D = yJ(x 2 - Xj ) 2 + (y 2 - yi ) 2 
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(a) Right-handed. (b) Left-handed. 


Fig. Al-1 
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Fig. A 1-2 


The length of a line segment can be measured by finding the distance between the endpoints of the 
segment using the formula. 

EXAMPLE 1. The length of the line segment joining points P 0 (-1.2) and F,( 3.5) can be found by 

D = ^/(5 - 2) 2 + [3 - 1)] 2 = v^+42 = 5 


Measuring Angles in Cartesian System 

The angles of a triangle can be measured in terms of the length of the sides of the triangle (see Fig. 
Al-3), by using the Law of Cosines, which is stated as 


c 2 — a 2 + b 2 — 2ni(cos 0) 
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C 



EXAMPLE 2. Refer to Fig. Al-4. To find the angle 6, we use the Law of Cosines: 

5 2 = 4 2 + 2 2 — 2(4)(2)cos0 or cos0 = — ^ so 0=108.21° 

16 

The angle formed by two intersecting lines can be measured by forming a triangle and applying the Law of 
Cosines. 


Describing a Line in Cartesian System 

The line is a basic concept of geometry. In a coordinate system, the description of a line involves an 
equation which enables us to find the coordinates of all those points which make up the line. The fact that a 
line is straight is incorporated in the quantity called the slope m of the line. Here m — tan 0, where 9 is the 
angle fonned by the line and the positive x axis. 

From Fig. Al-5 we see that tan 9 — Ay/ Ax. This gives an alternate formula for the slope: m = Ay/ Ax. 

EXAMPLE 3. The slope of the line passing through the points P () {— 1 , 2) and P t (3, 5) is found by 

Ay = 5 - 2 = 3 Ar = 3 - 1) = 4 

so m = Ay /Ax = |. The angle 6 is found by tan 6 = m = | or 0 = 36.87°. 




Fig. Al-5 


Fig. A 1-6 


276 


APPENDIX 1 


The straightness of a line is expressed by the fact that the slope of the line is the same regardless of which two 
points are used to calculate it. This enables us to find the equation of a line. 


EXAMPLE 4. To find the equation of the line whose slope is 2 and passes through the point P 0 ( 1 , 2), let P(x, v) be 
any point on the line. The slope is the same regardless of which two points are used in calculating it. Using P and P* 
we obtain 


Ay = v — 2 Ax = x — 1 


so 


Av 

m = — or 
Ax 


2 = 


y — 2 


x- 1 


Solving, we have y = 2x (see Fig. A 1-6). 


Every line has an equation which can be put in the form y — nix + b, where ni is the slope of the line 
and the point (0, b) is the y intercept of the line (the point where the line intercepts the y axis). 


Curves and Parametric Equations 

The equation of a curve is a mathematical expression which enables us to determine the coordinates of 
the points that make up the curve. 

The equation of a circle of radius r whose center lies at the point (b, k ) is 

(x-hf + (y-k) 2 =r* 

It is often more convenient to write the equation of a curve in parametric form; that is 

x=f(t) y = g(t ) 

where parameter t might be regarded as representing the “moment” at which the curve arrives at the point 
(x,y). 

The parametric equations of a line can be written in the fonn (Probs. A 1.21 and A 1.23) 

x — at + x 0 y — bt+ y 0 

EXAMPLE 5. The parametric equation of a circle of radius r and center at the origin (0, 0) can be written as 
x = r cos t and y = r sin /, where t lies in the interval 0 < t < 2n. 

A geometric curve consists of an infinite number of points. Thus any plot of such a curve can only approximate 
its real shape. Plotting a curve requires the calculation of the x and y coordinates of a certain number of the points of 
the curve and the placing of these points on the coordinate system. The more points plotted, the better the 
approximation to the actual shape. 

EXAMPLE 6. Plot five points of the equations x = t,y = / 2 for t in the interval [—1, 1]. 


t 

-1 

2 

0 

1 

2 

D 

ES 

-1 

1 

2 

0 

1 

2 


y 

1 

1 

4 

0 

1 

4 

D 


Plotting (x,y) gives Fig. Al-7. We can approximate the actual curve by joining the plotted points by line 
segments. 
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A1.2 THE POLAR COORDINATE SYSTEM 

The Cartesian coordinate system is only one of many schemes for attaching coordinates to the points 
of a plane. Another useful system is the polar coordinate system. To develop it, we pick any point in the 
plane and call it the origin. Through the origin we choose any ray (half-line) as the polar axis. Any point in 
the plane can be located at the intersection of a circle of radius r and a ray from the origin making an angle 
0 with the polar axis (see Fig. A 1-8). 



Fig. A 1-8 
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The polar coordinates of a point are given by the pair (r, 0). The polar coordinates of a point are not 
unique. This is because the addition or subtraction of any multiple of 2n (360° ) to 0 describes the same ray 
as that described by 0. 


Changing Coordinate Systems 

How are the Cartesian coordinates of a point related to the polar coordinates of that point? If (r, 0) are 
the polar coordinates of point P, the Cartesian coordinates (x, y) are given by 

x = r cos 0 y = r sin 0 

Conversely, the polar coordinates of a point whose Cartesian coordinates are known can be found by 

r 2 — x 2 + v 2 0 — arctan- 

x 


A1.3 VECTORS 

Vectors provide a link between geometric reasoning and arithmetic calculation. A vector is represented 
by a family of directed line segments that all have the same length or magnitude. That is, any two line 
segments pointing in the same direction and having the same lengths are considered to be the same vector, 
regardless of their location (see Fig. A 1-9). 



Properties of Vectors 

Vectors have special arithmetic properties: 

1. If A is a vector, then —A is a vector with the same length as A but pointing in the opposite 
direction. 

2. If A is a vector, then kA is a vector whose direction is the same as or opposite that of A, 
depending on the sign of the number k, and whose length is k times the length of A. This is an 
example of scalar multiplication. 

3. Two vectors can be added together to produce a third vector by using the parallelogram method or 
the heud-to-tail method. Thus is an example of vector addition. 

In the parallelogram method, vectors A and B are placed tail to tail. Their sum A + B is the vector 
detennined by the diagonal of the parallelogram formed by the vectors A and B (see Fig. Al-10). 

In the head-to-lail method, the tail of B is placed at the head of A. The vector A -I- B is detennined by 
the line segment pointing from the tail of A to the head of B (see Fig. A 1 - 1 1). 

Both methods of addition are equivalent, but the head-to-tail is easier to use when adding several 
vectors. 
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Fig. Al-10 


Fig. AI-11 


Coordinate Vectors and Components 

In a Cartesian coordinate system, vectors having lengths equal to 1 and pointing in the positive 
direction along the x and y coordinate axes are called the natural coordinate vectors and are designated as I 
and J (see Fig. A 1-12). 



By use of scalar multiplication and vector addition, any vector V can be written as a linear 
combination of the natural coordinate vectors. That is, we can find numbers a and b so that V = 
ol + bi. The numbers [«, b] are called the components of V. The components of a vector can be 
determined from the coordinates of the head and the coordinates of the tail of the vector. If (h x , h v ) and 
(/,, t v ) are the coordinates of the head and the tail, respectively, the components of V are given by 

a = h x — t x b = h v — t v 

Notice that if the tail of V is placed at the origin, the components of the vector are the coordinates of the 
head of V. 

The introduction of components allows us to translate the geometric properties of vectors into 
computational properties. If the vector A has components [x, , Vj] and the vector B has components [x 2 , 
the length of A, denoted as |A|, can be computed by 

|A| = •/■*!+*? 

To perform scalar multiplication by a number c, we have 

cA = exj I + cv'i J 
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and to perform vector addition 


A + B — ()f| + x 2 )I + (V[ + V 2 )J 


EXAMPLE 7. Find the components of the vector A whose tail is at P, ( 1 , 2) and whose head is at P 2 ( 3. 5) (see Fig. 
A 1 - 1 3). To find the components, we shift the tail of A to the origin. The head is at 

x = 3 — 1 =2 v = 5-2 = 3 


Thus A = 21 + 3 J. The length of A is 

|A| = s/2 2 + 3 2 = s/l3 

If B = -31 + 2J, then A + B = (2 - 3)1 + (3 + 2)J = -I + 5J. 




Fig. AI-I4 


The Dot Product 

The dot product A • B is the translation of the Law' of Cosines into the language of vectors. It is defined 
as 

A B = |A||B| cos 0 

w'here 0 is the smaller angle between the vectors A and B (see Fig. A 1-14). If A has components [.v, , v, ] 
and B has components [x 2 ,y 2 ], then A ■ B — x x x 2 +\\y 2 (componentwise multiplication). {Note-, since 
cos 90° = 0, two nonzero vectors A and B are perpendicular if and only if A • B = 0.) 


EXAMPLE 8. To find the angle 0 between the vectors A = 21 -f 3J and B = J, we use the definition of the dot 
product to find 


cos 0 ----- 


A B 

iaIbi 


A • B = (21 + 3J) • (01 + J) = 2 • 0 + 3 • 1 = 3 


|A| = s/^+lfi = v/TJ IB = Vo 2 4- 1 2 = 1 


So 


cos 0 = 


s/U 


and 6 = 33.69 
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A 1.4 MATRICES 

A matrix is a rectangular array or table of numbers, arranged in rows and columns. The notation a ^ is 
used to designate the matrix entry at the intersection of row / with column j (see Fig. A 1-1 5). 


Column j 



Fig. .41-15 


The size or dimension of a matrix is indicated by the notation m x n, where m is the number of rows in 
the matrix and n is the number of columns. 

A matrix can be used as an organizational tool to represent the information content of data in tabular 
form. For example, a polygonal figure can be represented as an ordered array of the coordinates 
of its vertices. The geometric transformations used in computer graphics can also be represented by 
matrices. 


Arithmetic Properties of Matrices 

Examples of these properties are as follows. 

1 . Scalar multiplication. The matrix k\ is the matrix obtained by multiplying every entry of A by the 
number k. 

2. Matrix addition. Two m x n matrices A and B can be added together to form a new m x n matrix 
C whose entries are the sum of the corresponding entries of A and B. That is, 

c ij — a ij + b tJ 


3. Matrix multiplication. An m x p matrix A can be multiplied by a p x n matrix B to fonn an m x n 
matrix C. The entry c (/ is found by taking the dot product of the i row of A with the j column of B 
(see Fig. AT -1 6). So c« = (row /') ■ (column j) = a^b Xj + a i2 b 2 j + • • • + a im b mj . Matrix multi- 
plication is not commutative in general. So AB ^ BA. Matrix multiplication is also called 
matrix concatenation. 


A 


Row i 






B 

Column j 



b v 

•>* 



Fig. 41-16 
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4. Matrix transpose. The transpose of a matrix A is a matrix, denoted as A 7 , formed by exchanging 
the rows and columns of A. If A is an m x n matrix, then A 7 is an n x m matrix. A matrix is said 
to be symmetrical if A = A 7 . 

Two basic properties of the transpose operation are ( 1 ) (A + B) 7 = A 1 + B and (2) (AB) 7 = B r A 7 . 


EXAMPLE 9 


— 2A 


AB 




o 

7 

and B = 

2 3 


l > 2 / 





' [3 • (— 1 )] + (2 - 2) + (5 . l ) 

k [l- (-!)] + [(-!) 2] + (21) 


(3 - 0) + (2 • 3) + (5 • 2) \ 
(1 • 0) + [(-!)• 3] + (2-2)/ 



Matrix Inversion and the Identity Matrix 

Tlie n x n matrix whose entries along the main diagonal are all equal to 1 and all other entries are 0 is 
called the identity matrix and is denoted by I (Fig. A 1-1 7). 


I 


/ 1 0 0 
[ 0 1 0 
0 0 1 


°\ 

0 

0 


Vo o o ••• i/ 


Fig. A I -17 


If A is also an n x n matrix, then AI = IA = A. That is, multiplication by the identity matrix I leaves 
the matrix A unchanged. Therefore, multiplication by the identity matrix is analogous to multiplication of a 
real number by 1 . 

An n x n matrix A is said to be invertible or to have an inverse if there can be found an n x n matrix, 
denoted by A -1 , such that A 1 A = AA 1 = I. The inverse matrix, if there is one, will be unique. 

A basic property of matrix inversion is (AB) -1 = B 1 A 1 . 


EXAMPLE 10 

a-G?) - M-i!) 


Then 


AM-( ] °W 1 °W ll+0( - 2 > 10 + °M-P °\ 

V 2 */ V -2 1 J ~\2 \ + l (- 2 ) 2 0 4-11 y — V 0 1 ) 
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and 


MA = 



OW 1 • 1 +0-2 
1 ) ~ V - 2 -1 + 1 -2 


1 0 + 0- 1 \ / 1 0\ 
— 2 ■ 0 + 1 • 1 / — \ 0 1 ) 


So MA = AM = I. Thus M must be A ' . 


A1.5 FUNCTIONS AND TRANSFORMATIONS 

The concept of a function is at the very heart of mathematics and the application of mathematics as a 
tool for modeling the real world. Stated simply, a function is any process or program which accepts an 
input and produces a unique output according to a definite rule. Although a function is most often regarded 
in mathematical terms, this need not be the case. The concept can be usefully extended to include processes 
described in nonmathematical ways, such as a chemical formula, a recipe or a prescription, and such 
related concepts as a computer subroutine or a program module. All convey the idea of changing an input 
to an output. Some synonyms for the word function are operator, mapping, and transformation. 

The quantities used as input to the function are collectively called the domain of the function. The 
outputs are called the range of the function. Various notations are used to denote functions. 

EXAMPLE 11. Some examples of functions are: 

1. The equation /(x) =x 2 + 2x + 1 is a numerical function whose domain consists of all real numbers and 
whose range consists of all real numbers greater than or equal to 0. 

2. The relationship T(\) = 2V is a transfonnation between vectors. The domain of T is all real vectors, as is 
the range. This function transfonns each vector into a new vector which is twice the original one. 

3. The expression H(x, _v) = (x. — y) represents a mapping between points of the plane. The domain consists of 
all points of the plane, as does the range. Each individual point is mapped to that point which is the reflection 
of the original point about the x axis. 

4. If A is a matrix and X is a column matrix, the column matrix Y found by multiplying A and X can be 
regarded as a function Y = AX. 


Graphs of Functions 

Ifx and v are real numbers (scalars), the graph of a function y — f(x) consists of all points in the plane 
whose coordinates have the fonn [x,/(x)], where x lies in the domain of /. The graph of a function is the 
curve associated with the function, and it consists of an infinite number of points. In practice, plotting the 
graph of a function is done by computing a table of values and plotting the results. This gives an 
approximation to the actual graph off. 

EXAMPLE 12. Plot five points for the function y = x 2 over the interval [—1. 1]. 


X 

-I 

1 

2 

0 


D 

o 

1 

1 

4 

0 

fl 

D 


Plotting the points (x, x 2 ) calculated in the table and joining these points with line segments gives an approximation to 
the actual graph of y = x 2 . See Fig. Al-7 for the plot of the graph. 

The plotting resolution is determined by the number of x values used in plotting the graph. The higher the 
plotting resolution, the better the approximation. 
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Composing Functions 

If the process performed by a function H can be described by the successive steps of first applying a 
function G and then applying a function F to the results of G, we say that H is the composition of F and G. 
We write H = F o G. If the input to the function is denoted by x, the output H(x) is evaluated by 

H(x) = F[G(x )] 

That is, first G operates on x; then the result G(x) is passed to F as input. 

Composition of functions is not commutative in general; that is, F o G f G o F. 

The concept of composition is not restricted to only two functions but extends to any number of 
functions. For functions that are represented by matrices, composition of functions is equivalent to matrix 
multiplication; that is, AoB = AB. 


EXAMPLE 13 

1. If f(x) =x 1 + 2 and g(.r) = 2x+ 1, then /[g(.v)] = [g(x)] 2 + 2 = (2x + l) 2 + 2 = 4.v 2 +Ax + 3. 

2. If 


*-(i 0 - -(1i) 

then 

a—Mj Od 0-(J “) 


The Inverse Function 

The inverse of a function/ (with respect to composition) is a function, denoted by/ -1 , that satisfies 
the relationships/ -1 of — i and / of 1 = /', where i is the identity function i(x) — x. Applying the above 
compositions to an element x, we obtain the equivalent statements: 

r l [f(x)\=x .nr l (x)]=x 

The inverse operator thus “undoes” the work drat / has performed. 

Not every function has an inverse, and it is often very difficult to tell whether a given function has an 
inverse. One must often rely on geometric intuition to establish the inverse of an operator. 

EXAMPLE 14. Let R be the transfonnation which rotates every point in the plane by an angle of 30° (in the 
positive or counterclockwise direction). Then it is clear that 7? -1 is the transfonnation that rotates every point by an 
angle of —30° (a rotation of 30° in the clockwise direction). 


Solved Problems 


Al.l Find the distance between the points whose coordinates are (a) (5,2) and (7,3), ( b ) (—3, 1) and 
(5,2), (c) (—3, —1) and (-5,-2), and (d) (0, 1) and (2,0). 

SOLUTION 

(a) D = yj(l - 5) 2 + (3 - 2) 2 = v/2 2 + l 2 = /5 

(b) D=J [ 5 - (— 3)] 2 + (2 - l) 2 = yV + (l) 2 = /65 
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( C ) D = y/[- 5 - (— 3)] 2 + [-2 - (-1)] 2 = yj(-2f + (-\) 2 = V5 
(d) D=J( 2- 0) 2 + (0 - l) 2 = + (- 1) 2 = s/5 

A1.2 Derive the equation for a straight line (see Fig. A 1 -5). 

SOLUTION 

A straight line never changes direction. We determine the direction of a line by the angle 6 the line makes 
with the positive .v axis. Then at any point P u on the line, the angle fonned by the line and a segment through 
P parallel to the x axis is also equal to 0. Let P u (x 0 ,y Q ) be a point on the line. Then if P(x.y ) represents any 
point on the line, drawing the right triangle with hypotenuse P U P, we find 

tan0=^> 

x-x 0 

The quantity tan 0 is called the slope of the line and its traditionally denoted by m. 

We rewrite the equation as 


x — x 0 Ax 

(The tenn Av is often called the “rise” and At, the “run.”) This can be solved for.v in terms of x. 


A1.3 Write the equation of the line whose slope is 2 and which passes through the point (—1, 2). 

SOLUTION 

Let P(x,y) represent any point on the line. Then 

Ay =y — 2 Ax = x — (— 1 ) = x + 1 
and m = 2. Using Ay /Ax = m, we find 

V — 2 


x+ 1 


-2 or y - 2 = 2(x + I ) = 2x + i 


thus v = 2x + 4. 


A1.4 Write the equation of the line passing through ( 1 , 2) and P 2 ( 3, —2). 

SOLUTION 

Let P(x, v) represent any point on the line. Then using F, , we compute 

Av = v — 2 At = x — I 
To find the slope m, we use P x and P 2 to find 


Then 


Then 


Ay =-2-2 = 

in 

II 

1 

r<-i 

II 

3 

1 

Av _ 

v — 2 

m = ~= -2 

so ' = —2 

Ax 

x — 1 

— 2 = — 2r + 2 

and v = — 2r + 4 


A1.5 Show that lines are parallel if and only if their slopes are equal. 

SOLUTION 

Reter to Fig. A 1 - 1 8. Suppose that lines /, and I 2 are parallel. Then the alternate interior angles 0, and 0 2 
are equal, and so are the slopes tan 11, and tan Q 2 . 
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Fig. AI-18 


y 

3 

U (2 

: 3) i 

P| 




2 

3 

X 

-2 





P 

'2 L( 5 

, -2) 


Fig. Al-19 


Conversely, if the slopes tan 0, and tan 0 2 are equal, so are the alternate interior angles 0, and 0 2 . 
Consequently litres /j and l 2 are parallel. 


A1.6 Let C/(2, 3) and L( 5, —2) be the upper left and lower right comers, respectively, of a rectangle 
whose sides are parallel to the .v and y axes. Find the coordinates of the remaining two vertices. 

SOLUTION 

Referring to Fig. Al-19, we see that the x coordinate of P ] is the same as that of I, namely 5, and they 
coordinate that of U, namely 3. So P ] = (5. 3). Similarly, P 2 = (2, -2). 


A1.7 Plot the points A( 1, 1), B(- 1, 1), and C(-4, 2). Then (a) show that ABC is a right triangle and (b) 
find a fourth point D such that ABCD is a rectangle (see Fig. A 1-20). 



SOLUTION 

(a) Show that the Pythagorean theorem is satisfied: 


AC = slB 2 +BC 2 
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Use the distance formula to compute the lengths of the sides of ABC: 

AB=J[ 1 -(- l )] 2 + [I -(-l)f = v / 2 2 + 2 2 = y8 
BC = v/[-l -(-4)] 2 +(-l -if = -J (3) 2 + (— 3) 2 = v^lS 
AC = J [ I -(-4)] 2 +(l - 2) 2 = y/5 2 -I- (- 1 ) 2 = v^26 


So 


AC 2 = 26 = AB 2 + BC 2 = 8 + 18 


(b) Let the unknown coordinates of D be denoted by (x.y). Use the fact that opposite sides of a rectangle are 
parallel to find .v andy. Since parallel lines have equal slopes, compute the slopes of all four sides: 


Slope AB : 


1 - 1 


-0 

- 1-1 


Slope CD -- 


_ 1 _ o) -3 

Slope BC = _ ^ = — = -1 Slope DA = ^ 

Then, for ABCD to be a rectangle 

Slope CD = slope AB Slope DA = slope BC 
or 


v — 2 ,y— 2 

.v — (— 4) x + 4 

v- 1 


.v — 2 
x + 4 


= 1 


This leads to the equations 


y — 2 =x + 4 


and 


and 


y- 1 
x — 1 


= -1 


v - 1 = -X + 1 


or 


-x + v = 6 


and 


x + v = 2 


Solving, x = —2 and y = 4. 


A1.8 Find the equation of a circle that has radius r and its center at the point (h, k). 

SOLUTION 

Refer to Fig. A 1-21. If P(x, y) is any point lying on the circle, its distance from the center of the circle 
must be equal to r. Using the distance formula to express this mathematically, we have 

D = y)(x — /i) 2 + (y — k) 2 = r 

So (x — hf + (y — k) 2 = r 2 , which is the equation of the circle. 


A1.9 Given any three points, not all lying on a line, find the equation of the circle determined by them. 

SOLUTION 

Refer to Fig. Al-22. Let P\(d ] . b r ), P 2 (a 2 . b 2 ), and P)(ci). A?) be the coordinates of the points. Let r be 
the radius of the circle and (/;, k) the center. Since each point is distance r from the center, then 

(a, - bf + (A, - kf = r 2 
(a 2 — hf + (b 2 — kf = r 1 
(a, - hf + {b y - kf = r 2 
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Fig. Al-21 Fig. Al-22 


This yields, after multiplying and collecting like terms: 

. , /. , a? — 0? bl — b ? 

(a 2 - a x )h+ (b 2 - b x )k = ~ ? + ~ ? 

(a 3 - a 2 )h + (b 3 - b 2 )* = -- - + 3 

These equations can be solved for /; and A- to yield 

h = i [^r(^2 - fr) + ) + ^</>, - a 2 )J 
2 d 

— I -a 3 ) + rfj(a 3 -a,) + <^(a, -a 2 )] 

a= T rf 

Here, dj = a 2 x + b], d\ — aj + b\, d] = a 3 4- b 3 , and d = a x (h 2 - h 3 ) + a 2 (b 3 - h] ) 4- a 3 (b x — b 2 ). Finally, r 
can be found: 


A1.10 Find the equation of the circle passing through the three points .P,(l, 2), P 2 ( 3, 0), and P 3 (0, -4). 

SOLUTION 

As in Prob. A 1 .9, we find 

d\ = a\ 4- b\ = 5 a 2 — a 3 = 3 b 2 — b 3 = 4 

d\ = a 2 + b 2 = 9 a 3 — a] = — 1 b 3 — b x = —6 

= a| +h| = 16 a x — a 2 = — 2 b x — b 2 =2 


d= l(4) + 3(-6)+0(2) = -14 
and 


, — 1 [5(4) + 9( — 6) + 16(2)] 

2 

28 

~ 28 ~ 

, 1 [5(3) + 9(— 1) 4- I6(— 2)] 

_ -26 

A 

28 

28 


Therefore, the center of the circle is located at 

\\4' 14 / 
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and the radius is calculated by 



A1 .1 1 Show that x = r cos t, y = r sin t are the parametric equations of a circle of radius r whose center is 
at the origin. 

SOLUTION 

By Prob. A1 .8 we must show that x 2 -hy 2 = r 2 . Using the trigonometric identity cos 2 t + sin 2 1 = 1, we 
obtain 


x 2 +V 2 = (r cos I ) 2 + (r sin t ) 2 = r 2 cos 2 1 + r 2 sin 2 t = r 2 (cos 2 / + sin 2 /) = r 2 


A 1.1 2 Show that the parametric equations 

a + bt c + dt 

x — — v = — 

e+ft ■ e+ft 

are the equations of a line in the plane. 

SOLUTION 

We show that the slope 

A v _ y 2 — >'] 

Av x 2 — X] 

is a constant, independent of the parameter l. So 

V; - V'l (c + dt 2 )/{e +ft 2 ) ~ (C + +/q ) 

x 2 - x, (a + bt 2 )/(e +ft 2 ) - (a + bt { )/(e +/q) 

ce + det 2 + cft\ + dft 2 t\ — ce — cft 2 — det x — dft 2 t x 

ae + afl x + bet 2 + bft 2 1\ — ae — aft 2 — ebt , — bft 2 t l 

de(t 2 ~ h) ~ cf(t 2 ~ 1 1 ) de-cf 

be(t 2 - h) - af(t 2 - /,) be-af 

So if be — af ^ 0, the slope Av/ Ax is constant, and so this is the equation of a line. 


A1.13 Let the equations of a line be given by (Prob. A 1.1 2) 

\+t 2+t 

x = - v = 

1 - t 1 - t 

Then (a) plot the line for all values of t, ( b ) plot the line segment over the interval [0, 2], and (c) find 
the slope of the fine. 

SOLUTION 

Making a table of values, we have 


t 

-i 

1 

2 

0 

1 

2 

2 

3 

X 

0 

1 

3 

l 

3 

-3 

-2 

y 

i 

l 

2 

5 

-4 



* 
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The resulting line is shown in Fig. A 1-23. 

(a) We observe the following: (1) the line is undefined at t = 1, (2) (x, v) — » (oo, oo) as t — ► I - , (3) 

(x,y) — *■ (— oo, — oo) as t — * 1 + , and (4) (x, y) —► (—1, — 1) as t — *• ±oo (see Fig. Al-23). 

( b ) The interval [0,2] includes the infinite point at t = 1. The corresponding region is the exterior line 
segment between points Pj(l, 2) at t = 0 and P 2 ( — 3. -4) at t = 2 (see Fig. Al-24). 

( c ) From Prob. A1.12, the slope of the fine is, with a = 1, b = 1, c = 2, d = 1, e = 1, and / = — 1. 

A v ( 1 )( 1 ) — (2)( — 1 ) 3 

Ax (1)(1)-(1)(-1) 2 

A1.14 Let A = 21 + 7J, B = -31 + J, and C = I - 2J. Find (a) 2A - B and (b) -3A + 5B - 2C. 

SOLUTION 

Pertbnn the scalar multiplication and then the addition. 

(a) 2A — B = 2(21 4- 7J) - (-31 + J) = (41 + 14J) 4- (31 - J) 

= (4 4- 3)1 4- (14- l)J =71 + 13J 

(b) - 3A + 5B - 2C = -3(21 + 7J) + 5(— 31 + J) - 2(1 - 2J) 

= (-61 - 21 J) + (-151 + 5J) + (-21 + 4J) 

= (—6 — 15 -2)1 + (-21 + 5 + 4)J = —231 — 12J 

A1.15 Find jc and y such that 2.rl + (v — 1)J =yl + (3x + 1 )J. 

SOLUTION 

Since vectors are equal only if their corresponding components are equal, we solve the equations (1) 
2x =y and (2) y — 1 = 3jc + 1. Substituting into equation (2), we have (2x) — 1 = 3x + 1 and —2 = x and 
finally y = 2x = 2(— 2) = —4, sor= -2 and y = —4. 

A1.16 The tail of vector A is located at P(— 1 , 2), and the head is at 0(5. —3). Find the components of A. 

SOLUTION 

Translate vector A so that its tail is at the origin. In this position, the coordinates of the head will be the 
components of A. 

Translating P to the origin is the same as subtracting — 1 from the x component and 2 from the y 
component. Thus the new head of A will be located at point Q x , whose coordinates (,vj , y, ) can be found by 

x, =5 - (-1) = 6 y, = -3 - 2 = -5 

Thus A = 61 — 5J. 

A1.17 Given the vectors A = I + 2J and B = 21 — 3J, find (a) the length, ( b ) the dot product, and (c) the 
angle 0 between the vectors. 

SOLUTION 

(a) |A| = y / l 2 + 2 2 = 75 |B| =v /2 2 +(-3) 2 = +T3 

(/>) A B = (I + 2J) • (21 - 3J) = (1 2) + [2 • (-3)] = 2 - 6 = -4 

(c ) From the definition of the dot product, we can solve for cos 6: 

A B -4 

cos 0 = - = 

I A|| B | S/5-/I3 


So 0= 119.74°. 
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Fig. A 1-23 



Fig. Al-24 
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A1.I8 Find the unit vector U A having the direction of A = 21 - 3J. 

SOLUTION 


A 

Since U A = - it follows that 
|A| 


iai = J2 2 + (— 3) 2 = yn 


and 


Ua 


A 21 -3J 2 

I A | _ y/U “/I3 1 



A1.19 Show that the commutative law' for the dot product 

A B = B A 


holds for any vectors A and B. 

SOLUTION 

Let 

A = £/ 1 I “l - QyJ B = ft, I + by J 
So 

A • B = a, ft, +a 2 b 2 B • A = ft,a, +b 2 a 2 

Comparing both expressions, we see that they are equal. 

At. 20 Show that the distributive law for the dot product 

(A + B)C = AC + BC 


holds for any vectors A, B, and C. 

SOLUTION 

Let 


A=a,I-Fa 2 J B = ft, 1 + ft 2 J C = c,I + c 2 J 


So 


A + B — (n, 4- ft,)I + (a 2 “F ft 2 )J 


and 


(A + B) • C = (a, 4- ft, )c x + (a 2 + b 2 )c 2 = n,c, + ft, c, + a 2 c 2 + b 2 c 2 


On the other hand. 


A • C = a,c, + a 2 c 2 B • C = ft,c, + b 2 c 2 


so 


A • C + B ■ C = a,c, 4- a 2 c 2 4- ft,c, 4- b 2 c 2 
Comparing both expressions, we see that they are equal. 


A1.21 Show that the equation of a line can be determined by specifying a vector V having the direction of 
the line and by a point on the line. 
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SOLUTION 

Suppose that V has components [a. b ] and the point Po(x 0 ,y 0 ) is on the line (see Fig. A I -25). If P(x,y) is 
any point on the line, the vector P 0 P has the same direction as V, and so, by the definition of a vector, it must 
be a (scalar) multiple of V, that is P 0 P = /V. The components of P 0 P are [x — x 0 ,y — y 0 ] and those of t\ are 
i[u, />]. Equating components, we obtain the parametric equations of the Line: 

x — x 0 = ta y — y 0 = lb or x = at + x 0 y = bt +y 0 



Fig. At -25 


The nonparametric form of the equation can be determined by eliminating the parameter t from both 
equations. So 

x~x 0 _ v-y 0 

a b 

Solving for y, we have 

y = ^+(, 0 -^o) 

A1.22 Find the (a) parametric and (/;) nonparametric equation of the line passing through the point 
P 0 ( 1, 2) and parallel to the vector V = 21 + J. 

SOLUTION 

As in Prob. A 1 .2 1 , we find, with a = 2, b = 1 , x 0 = 1 , and y 0 = 2, that (a) x = 2t + 1 , y = M- 2 and (b) 
with b/a = 1 v = \x + (2 - |) = |x + 1. 

A1.23 Find the parametric equation of the line passing through points P x ( 1, 2) and P 2 (4, 1). What is the 
general form of the parametric equation of a line joining points P,(X] , y, ) and P 2 (x 2 ,y 2 )? 

SOLUTION 

Refer to Fig. AI-26. Choosing V = P,P 2 = (4 - 1)1 + (1 — 2) J = 31 — 1J. Then as in Prob. A 1 .2 1 . 
x = 3t — 1 and y = —t — 2. In the general case, the direction vector V is chosen, as above, to be 
P, P 2 = (x 2 — X| )I + (v *2 — V] )J. The equation of the line is then 

X = X, + (x 2 - X] )t y = y, + (y 2 - y, )/ 

A1.24 Find the number c such that the vector A = I + cJ is orthogonal to B = 21 — J. 

SOLUTION 

Two nonzero vectors are orthogonal (perpendicular) if and only if their dot product is zero. So 
A • B = (1 + cJ) • (21 — J) = (1 • 2) + [c(— 1)] = 2 — c 
So A and B are orthogonal if 2 — c = 0 or c = 2. 
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A1.25 Compute: 

» (i o 

<*> o nn: ;) 

« <0 4 o 

SOLUTION 


(a) Adding corresponding entries, we obtain 

(5 4 1 \ / 2 -I 3\_/5 + 2 4-1 l+3\ _ (l 3 4 \ 

\0 -1 7 ) \2 0 1 / ~ \ 0 + 2 -1+0 7 + 1 / ~ \2 -1 8 ) 


(b) Since the matrices are of different sizes, we cannot add them. 

(c) Multiplying each entry by 3, we have 


3 


4 

-1 



3 

21 


) 


A1.26 Let 


A = 




Find 2A - 3B. 

SOLUTION 


First multiply, and then add: 


2A-3B = 2 (3 J) + (^ 


-15 
9 


21 \ / 6 — 15 

6 ) ~ [ 0 - 9 


4 + 21 \ _ / — 9 25 \ 

2 + 6 ) ~ V-9 8J 


A1.27 Determine the size of the following matrix multiplications A • B, where the sizes of A and B are 
given as (a) (3 x 5), (5 x 2); ( b ) (1 x 2), (3 x 1); (c) (2 x 2), (2 x 1); and (cl) (2 x 2), (2 x 2). 
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SOLUTION 

(a) (3 x 2); ( b ) undefined, since the column size of A (2) and the row size of B (3) are not equal; (c) 
(2x1); (d) (2x2). 


A1.28 Find the sizes of A and B so that AB and BA can both be computed. Show that, if both A and B are 
square matrices of the same size, both AB and BA are defined. 

SOLUTION 

Let the size of A be (m x n) and the size of B be (r x s). Then AB is defined only if r = n. Also, BA is 
defined only if.v = m. Thus, if A is (m x n), then B must be (n x m). If A is square, say, (n x n), and B is also 
(n x n), then both AB and BA are defined. 


A 1.29 Given 


A = 



find A' . 

SOLUTION 

Exchanging the rows and columns of A, we obtain 


A t = 


-a ; !) 


A1.30 Compute AB for 


(a) A 


(b) A 


( c ) A 


-( 
-( 
= ( 


) 


and 


and 


and 


B 


=(- 4 0 


B = 


B 


-4 

7 


(-4 5 9 \ 

^ 7 6 10 ) 


SOLUTION 

(a) Since A is (2 x 2) and B is (2 x 1), then AB is (2 x 1): 




( b ) 




2 -(-4) + 3 -7 2-5 + 
4) + 2 -7 1-5 + 


3 • 6 \ / 13 28 \ 

2-6/ “V 10 17 j 


(c) A • B 


(2 3 \ / — 4 5 9\ _ / 2 • (—4) -t- 3 • 7 2-5+3-6 2-9 + 3 10\ 
_ \1 2jV 7 6 loj “ V 1 -(-4) + 2-7 1 -5+2-6 1-9 + 2-loJ 


/ 13 28 48 \ 
“ \ 10 17 29 ) 
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A1.31 Let 



and 


B = 


6 2 1 
3 5 8 


Find («) AB and (b) BA 

SOLUTION 

'3 

(а) AB = | 5 

(б) BA 

A1.32 Find the inverse of A 
SOLUTION 

We wish to find a matrix 



3 • 2 + 2 • 5 
5-2 + 65 
2-2+1 - 5 


3 + 2 • 5 + I - 2 
3 + 5-5 + 8 2 


6 • 2 + 2 - 6 + 1 
3 • 2 + 5 • 6 + 8 




" ("r J) 


so that 


Multiplying, we have 


a 2)(: :)-(! ?) 

/ p + 2r q + Is \ / 1 0 \ 

\3p + 4r 3q + 4sj~ \0 1/ 


So p + 2r = 1 , q + 2s = 0, 3p + 4r = 0, and 3<7 + 4,s = 1. Solving the first and third equations we find 
p = —2, r — §. Solving the second and fourth equations gives 9 = 1 and s = — j. So 


k-i 


-d -i) 


A1.33 Let G be the function which multiplies a given vector by 2 and F be the function that adds the 

vector b to a given vector. Find (a) F + G, (b) F oG, (c) G o F, (d) F l , and (e) G~ ] . 

SOLUTION 

If v is any vector, the functions F and G operate on v as F(\) = 2v and G(v) = v + b. 

1 . (F + G)(v) = F(\) + G(v) = (2v) + (v + b) = 3v + b. 

2. (F o G)(v) = F[G(v)] = 2[G(v)] = 2[v + b] = 2v + 2b. 

3. (Go F)(\) = G[F(v)J = [F(y)] + b = 2v + b. 

4. We can guess that 1 (v) = jV. To check this, we set F -, [F(v)] = ^[F(v)] = i[2v] = v and 
F[F- 1 (v)] = 2[F- , (v)] = 2[(i)v] = v. 

5. We can verify that G -I (v) = v— b: G -l [G(v)| = G _1 (v + b) = (v + b) — b = v and G[G _1 (v)] = 
G“'(v) + b = (v — b) + b = v. 


A1.34 Show that A o B = AB for any two matrices (that can be multiplied together). 

SOLUTION 


The tenns A c B and AB produce the same effect on any column matrix X, i.e., (A o B)(X) = ABX. 
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Recall that any matrix function A(X) is defined by A(X) = AX. So 

(A o B)(X) = A[B( X)J = A(BX) = ABX 


A1 .35 Given that A is a 2 x 2 matrix and b is a vector, show that the function F(X) = AX + b, called an 
affine transformation, can be considered as either a transformation between vectors or as a mapping 
between points of the plane. 

SOLUTION 


Suppose that 



and b has components [ri, , b 2 ]. If X is a vector with components [x, , x 2 ], then 


AX = 


( «ii 

V "21 



can be identified with the vector having components [a u x x + a n x 2 .a 2 i- v i + a 22 x i]- And so AX + b is a 
vector. 

If X = (v | , x 2 ) is a point of the plane, then as a point mapping, F(X) = \f { {X),f 2 {X)\, where the 
coordinate functions /, and f 2 are 


J] ( .V) — a 1 1 .v j -f- a j 2 _v 2 -f - b ' and / 2 ( A ) — n 2 ^Xj -4- a 22 x 2 b 2 


A1.36 Show that for any 2x2 matrix A and any vector b the transformation F(X) = AX + b transforms 
lines into lines. 


SOLUTION 


Let x = at + x 0 and v = bt -f y 0 be the parametric equations of a line. With X = (x.y) then 

AX= / a n at + x 0 ) ( a u at + a u x 0 + a l2 bt + a l2 y 0 \ 

\ Ch] a 22 ) \ bt “h.Po / \ a 2 \at + (1 2x Xq + d 22 bt + d 22 )>0 J 


F(X) = AX + b = 


/ t(a u a + a n b) + (a n x„ + d\zVo + ^i) \ 

\ K a 2\ a + a 2lb) + ( a 21*0 + a 22 v 0 + b 2 ) J 


This can be recognized as the parametric equation of a line (Prob. A1.21) passing through the point with 
coordinates (a u x 0 -fa 12 yo + b x , a 21 x 0 + a 2 2 . v 'o + b 2 ) and having the direction of the vector v with compo- 
nents [a xx a + a\ 2 b, a 2X a + a 22 b]. 


A1.37 Show that the transformation F(X) = AX + b transforms a line passing through points F, and P 2 
into a line passing through F(P, ) and F(P 2 ). 

SOLUTION 

As in Prob. A 1.23, the parametric equation of the line passing through P, and P 2 can be written as 
x = x, + (x 2 - x, )/ .v = v, -I- ( v 2 - y, )t 

As in Prob. A 1.36 with a =x 2 — Xj and b =y 2 — y 2 , we find that F transforms this line into another line. 
Now when t = 0, this line passes through the point 

(fl]|X| y a ■ 2 A] + b x , a 2 \X x +fl 2 2yi "b ^ 2 ) = F(Pi) 
and when t = 1 , it passes through the point 

(a u a + a u b + n n x, + a X2 y x + b x , a 2X a + a 12 b + a 2x x x + a 22 y x + b 2 ) = F(P 2 ) 
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A2.1 THREE-DIMENSIONAL CARTESIAN COORDINATES 

The three-dimensional Cartesian (rectangular) coordinate system consists of a reference point, called 
the origin, and three mutually perpendicular lines passing through the origin. These mutually perpendi- 
cular lines are taken to be number lines and are labeled the x, y, and z coordinate axes. The labels are 
placed on the positive ends of the axes (see Fig. A2-1). 



Fig. A2-1 


Orientation 

The labeling of the x, y, and z axes is arbitrary. However, any labeling falls into one of two 
classificatioas, called right- and left-handed orientation. The orientation is determined by the right-hand 

rule. 
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The Right-Hand Rule 

A labeling of the axes is a right-handed orientation if whenever the fingers of the right hand are 
aligned with the positive x axis and are then rotated (through the smaller angle) toward the positive v axis, 
then the thumb of the right-hand points in the direction of the positive z axis. Otherwise, the orientation is a 
left-handed orientation (see Fig. A2-2). 




( b ) Left-handed orientation. 


Fig. A2-2 

Cartesian Coordinates of Points in Three-dimensional Space 

Any point P in three-dimensional space can have coordinates (x,y,z) associated with it as follows: 

1 . Let the x coordinate be the directed distance that P is above or below the yz plane. 

2. Let tire y coordinate be the directed distance that P is above or below the xz plane. 

3. Let the z coordinate be the directed distance that P is above or below the xy plane. 

See Fig. A2-3. 



Fig. A2-3 
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Distance Formula 

If fyUoAo' -o) and P,(x,, Vj.z,) are any two points in space, the distance D between these points is 
given by the distance formula: 

D - yj (x, - x 0 )“ + (Vj - v 0 )- + (z, - Z 0 ) : 

A2.2 CURVES AND SURFACES IN THREE-DIMENSIONS 
Curves 

A three-dimensional curve is an object in space that has direction only, much like a thread (see Fig. 
A2-4). A curve is specified by an equation (or group of equations) that has only one free (independent) 
variable or parameter, and the x, y, and z coordinates of any point on the curve are determined by this free 
variable or parameter. There are two types of curve description, nonparametric and parametric. 



Fig. A2-4 

1 . Nonparametric curve description. 

(a) Explicit form. The equation for curve C are given in terms of a variable, say, x, as 

C: y =f(x) z - g(x) 

That is, y and z can be calculated explicitly in terms of x. Any point P on the curve has 
coordinates P[x,/(x), g(x)j. 

(b) Implicit form. The equations of the curve are F(x,y, z) = 0 and G(x, y, z) = 0. Here,y and z 
must be solved in terms of x. 

2. Parametric curve description. The three equations for determining the coordinates of any point on 
the curve are given in terms of an independent parameter, say, t, in a parameter range [a, b ], which 
may be infinite: 

x=m 

C: V = g(t), 
z = h(t ) 


a < t < b 
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Any point P on the curve has coordinates [/(/), g(t), h(t)]. 

• Equations of a straight tine. The equations of a line L determined by two points P 0 (x 0 ,.Vo. arR i 

P ] (x, , vj , z\ ) are given by: 


nonparametric form 


V = ni\X + Z>, 
L: 


z — m 2 x + b 2 


/ >i -.Vq \ x + / >VAi 
Vi~x 0 ) V *i-*o / 

2,j-Zo \ y + / ZpX i ~2|.V 0 \ 
*1 - *0/ \ Xi ~ X 0 / 


parametric form 

x = x 0 + (x, - x 0 )f v = V 0 + (V, - y 0 )t z = z 0 + (z, - z 0 )f 

Note that when t = 0, then x = x 0 , y = _v 0 , and z = z 0 . Wlien t = 1 , then x = X] , y = y x , and 
z = Z[ . Thus, when the par amete r t is restricted to the range 0 < t < 1 , the parametric equations 
describe the line segment P Q P X . 


Surfaces 

A surface in three-dimensional space is an object that has breadth and width, much like a piece of cloth 
(see Fig. A2-5). 



Fig. A2-5 


A surface is specified by an equation (or group of equations) that has two free (or independent) 
variables or parameters. There are two types of surface description, nonparametric and parametrics. 

1 . Nonparametric surface description 

(a) Explicit form. The z coordinate of any point on the surface S is given in tenns of two free 
variables x and y, that is, z — f(x,y). Any point P on the surface has coordinates 
[x, v,/(x, v)]. 
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(h) Implicit form. The equation of the surface is given in the fonn F(x, y, z) = 0. Here, z is to be 
solved in terms of x and y. There is no restriction as to which variables are free. The 
convention is to represent z in terms of .v and v, but nothing disallows a representation of.\- in 
tenns of v and z or y in terms of x and z. 

2. Parametric description. The three equations for determining the coordinates of any point on the 
surface S are described in terms of parameters, say, s and t, and in parameter ranges [a. b ] and 
[c, d], which may be infinite: 

x =f(s , f), a < s < b 

S : y — g(s, t), c <t <d 

z — h(s , /) 

The coordinates of any point P on the surface have the form [/(s, t). g(s , t ), h(s. /■)]. 

• Equations of a plane. The equation of a plane can be written in explicit fonn as z = ax + by + c 
or in implicit fonn as Ax + By-\- Cz + D — 0 (see Prob. A2.8). The equation of a plane is linear 
in the variables x, v, and z. A plane divides three-dimensional space into two separate regions. 
The implicit fonn of the equation of a plane can be used to detennine whether two points are 
on the same or opposite sides of the plane. Given the implicit equation of the plane 
Ax + By + Cz + D = 0, let f(x, y, z) — Ax -f By + Cz + D. The two sides of the plane 7? + , R 
are detennined by the sign of f(x,y,z); that is, point P{x 0 ,y 0 ,z 0 ) lies in region R + if 
f(x 0 , v’o, z n ) > 0 and in region R f(x 0 ,y 0 ,z 0 ) < 0. If/(v 0 ,v 0 , z ( >) = 0, the point lies on the 
plane. The equations x = 0, y = 0, and z = 0 represent the yz, xz, and xy planes, respectively. 

• Quadric surfaces. Quadric surfaces have the (implicit) fonn Ax 2 + By 2 + Cz 1 + Dxy + Exz + 
Fyz + Gx + Hy + lz + J = 0. The basic quadric surfaces are described in Chap. 9. 

• Cylinder surfaces. In two dimensions, the equation y =f(x) represents a (planar) curve in the xy 
plane. In three dimensions, the equation y =f(x) is a surface. That is, the variables x and z are 
free. This type of surface is called a cylinder surface (see Fig. A2- 6). 



EXAMPLE 1. The equation x 1 +.v 2 = 1 is a circle in the xy plane. However, in three dimensions, it represents a 
cylinder (see Fig. A2-7). 
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Fig. A2-7 


A2.3 VECTORS IN THREE-DIMENSIONS 

The definition of a vector and the concepts of magnitude, scalar multiplication, and vector addition are 
completely analogous to the two-dimensional case in App. 1 . 

In three-dimensions, there are three natural coordinate vectors I, J, and K. These vectors are unit 
vectors (magnitude 1 ) having the direction of the positive x, y, and z axes, respectively. Any vector V can 
be resolved into components in terms of I, J, and K: V = al + hJ + cK. 

The components [a, b, c ] of vectors V are also the Cartesian coordinates of the head of the vector V' 
when the tail of V is placed at the origin of the Cartesian coordinate system (see Fig. A2-8). 



Fig. A2-8 
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EXAMPLE 2. Let / > 0 (x 0 ,.Vo> z o) and P \ (*i . >'i . z \ ) be two points in space. The directed line segment P 0 P , defines a 
vector whose tail is at P 0 and head is at P,. 

TO find the components of P t ,P\ , we must translate so that the tail P 0 is placed at the origin. The head of the 
vector will then be at the point (x t — x () , Vj — >'o. -i — zo)- The components of P 0 P\ are then 

P»P\ ~ (-*1 — + (Vi ~ Vo)J + (2| — z 0 )K 

Vector addition and scalar multiplication can be performed componentwise, as in App. I . The magnitude of a vector 
V, |V|, is given by the formula 


|V| = s/a 1 + b 1 + c 2 


For any vector V, a unit vector (magnitude 1) U v having the direction of V can be written as 


U v = 


V 

Tvi 


The Dot and the Cross Product 

Let V[ = a x \ + b ] J + CjK and V 2 = a 2 I + Z> 2 .J + c 2 K be two vectors. 

The dot or scalar product of two vectors is defined geometrically as Vj • V 2 = | Vj ||V 2 cos 0, where 0 
is the smaller angle between V, and V 2 (when the vectors are placed tail to tail). The component form of 
the dot product can be shown to be 


V, • V 2 =a ] a 2 + b ] b 2 +c ] c 2 


Note that the dot product of two vectors is a number and the order of the dot product is immaterial: 
Vj • V 2 — V 2 • V, . This formula enables us to calculate the angle 6 between two vectors from the formula 

r y V, -V 2 a x a 2 +b l b 2 +c,c 2 

“ IV, ||V 2 | " >2 + b\ + +b\+c\ 

Note that two vectors are perpendicular (orthogonal) (i.e., 6 = 90°) if and only if their dot product 
V] • V 2 = 0. This provides a rapid test for determining whether two vectors are perpendicular. (Equiva- 
lently, we say that two vectors are parallel if they are scalar multiples of each other, i.e, Vj = k\ 2 for some 
number k.) 

The cross product of two vectors, denoted V j x V 2 , produces a new vector defined geometrically as 
follows: Vj x V 2 is a vector whose magnitude is |Vj x V?| = |V X ||V 2 | sin 9, where 0 is the angle between 
V! and V 2 and whose direction is determined by the right-hand rule: V! x V 2 is a vector perpendicular to 
both V j and V 2 and whose direction is that of the thumb of the right hand when the fingers are aligned with 
V j and rotated toward V 2 through the smaller angle (see Fig. A2-9). 

From this definition, we see that the order in which the cross product is performed is relevant. In fact: 

V, xV, = -(V 2 xV,) 


Note also that V x V = 0 for any vector V, since 0 — 0°. The component form for the cross product can be 
calculated as a determinant as follows: 


I 

J 

K 

"1 


Cl 

a 2 

b 2 

c 2 


h 



C 2 


a 2 





a, bi 

a 2 b 


— ( b { c 2 - b 2 c. x )\ + (c,a 2 - c 2 a ,)J + (a x b 2 - a 2 b 1 ) K 


EXAMPLE 3. For a right-handed Cartesian coordinate system, we have I x J = K, JxK = I, IxK = - J. 
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Fig. A2-9 


The Vector Equation of a Line 

A line L in space is determined by its direction and a point / J 0 (x 0 , y 0 , z 0 ) that the line passes through. If 
the direction is specified by a vector V = al + hJ + cK and if P(x, _v, z) is any point on the line, the 
direction of the vector P 0 P determined by the points P 0 , and P is parallel to the vector V (see Fig. A2-10). 
Thus, P 0 P = t\ for some number t. 



Fig. A2-10 
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In component form, we find that (x — .v 0 )I + (y — y 0 )J + (z - z 0 )K = ta\ + tbi + fcK. Comparison of 
components leads to the parametric equations: 

x—XQ + at y — Vq + bt z — z 0 + ct 

In Probs. A2.5 and A2.6 it is shown how the equations of a line are determined when given two points 
on the line. 


The Vector Equation of a Plane 

A vector N is said to be a normal vector to a given plane if N is perpendicular to any vector V which 
lies on the plane; that is, N • V — 0 for any V in the plane (see Fig. A2-1 1 ). A plane is uniquely determined 
by specifying a point P 0 (x 0 ,y’o,z 0 ) that is on the plane and a normal vector N = «jl + n 2 J + « 3 K. Let 
P(x,y, z) be any point on the plane. Then the vector P (I P lies on the plane. Therefore, N is perpendicular to 
it. So N • = 0. 



In component fonn, we obtain 


[«ll + + w 3 K 1 ' K* — *o)l + (v — >’o)J + (z — z 0 )K] = 0 


or 


«i(* - *o) + « 2 0 -To) + ”s( 2 - z 0 ) = 0 

The equation of a plane can also be detennined by specifying (1 ) two vectors and a point (Prob. A2.1 0) and 
(2) three points (Prob. A2. 1 1 ). Using vector notation, we can write the distance D from a point P(x, y, z) to 
a plane as 


[m,(.y - .v 0 ) + n 2 (y - vq) + n y (z - z (l )] 
y/nj +n\+n\ 


or 


\X_ I’oPI 

|N| 


where N = «, I + n 2 J + w 3 K is a nonnal vector to die plane and P 0 (.y 0 ,.Vo. z 0 ) is a pobt on tlie plane (Prob. 
A2.13). 
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A2.4 HOMOGENEOUS COORDINATES 
The Two-dimensional Projective Plane 

The projective plane was introduced by geometers in order to study the geometric relationships of 
figures under perspective transformations. 

The two-dimensional projective plane P, is defined as follows. 

In three-dimensional Cartesian space, consider the set of all lines through the origin and the set of all 
planes through the origin. In the projective plane, a line through the origin is called a point of the projective 
plane, while a plane through the origin is called a line of the projective plane. 

To see why this is “natural” from the point of view of a perspective projection, consider the 
perspective projection onto the plane z = 1 using the origin as the center of projection. Then a line through 
the origin projects onto a point of the plane z = I , while a plane through the origin projects onto a line in 
the plane z = 1 (Fig. A2-12). 



In this projection, lines through points (.v, y, 0) in the plane project to infini ty. This leads to the notion 
of ideal points, discussed later. 


Homogeneous Coordinates of Points and Lines of the Projective Plane 

If (a, b, c) is any point in Cartesian three-dimensional space, this point determines a line through the 
origin whose equations are 


x = at 

y — bt (where t is a number) 
z = ct 

That is, any other point (at, bt, ct) determines the same line. So two points (a, , b ] . Cj ) and (a 2 , b 2 , c 2 ), are 
on the same line through the origin if there is a number t so that 

b 2 = b^t 


a 2 = a \t 


c 2 - c { t 


( A2.1 ) 
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We say that two triples, (a , , h 1 , cq ) and (a 2 . b 2 , c 2 ), are equivalent (i.e., define the same line through the 
origin) if there is some number t so that the equations (A2.1) hold. We write (a l , b l , c, ) ~ (a 2 , b 2 , c 2 ). The 
equivalence classes of all triples equivalent to (a, b, c), written as [a, b, c ], are the points of the projective 
plane. Any representative (cq, b ly <q) equivalent to (a, b, c) is called the homogeneous coordinate of the 
point [«, b, c\ in the projective plane. 

The points of the form (a, b, 0) are called ideal points of the projective plane. This arises from the fact 
that lines in the plane z — 0 project to infinity. In a similar manner, any plane through the origin has an 
equation n x x + ny + n \ z — 0- Note that any multiple kn x x + kny + kn 2 z = 0 defines the same plane. 

Any triple of numbers (« 1 ,» 2 ,«j) defines a plane through the origin. Two triples are equivalent, 
(«i , n 2 , n 3 ) ~ (d x . d 2 , d 3 ) (i.e., define the same plane), if there is a number k so that d x = kn , , d 2 = kn 2 , 
and d 2 = Cm,. The equivalence classes of all triples, [«j, n 2 , « 3 ], are the lines of the projective plane. Any 
representative (d x , d 2 , r/,) of the equivalence class [«, , n 2 , « 3 ] is called the homogeneous line coordinate of 
this line in the projective plane. 

The ambiguity of whether a triple (a, b, c) represents a point or a line of the projection plane is 
exploited as the Duality Principle of Projective Geometry. If the context is not clear, one usually writes 
(a, b, c) to indicate a (projective) point and [a, b, c] to indicate a (projective) line. 


Correlation between Homogeneous and Cartesian Coordinates 

If (x l ,y l ,z l ), Z[ yt 0 are the homogeneous coordinates of a point of the projective plane, the equations 
x = Ay /z, and y — y l jz x define a correspondence between points P x (*] , V] , z l ) of the projective plane and 
points P(x,y) of the Cartesian plane. 

There is no Cartesian point corresponding to the ideal point (x, , jq , 0). However, it is convenient to 
consider it as defining an infinitely distant point. 

Also, any Cartesian point P(x,y) corresponds to a projective point Pixy.x^ , Zj) w'hose homogeneous 
coordinates are xq =x, _vq =y, and z y = 1. This correspondence between Cartesian coordinates and 
homogeneous coordinates is exploited when using matrices to represent graphics transformations. Tire use 
of homogeneous coordinates allows the translation transformation and the perspective projection 
transformation to be represented by matrices (Chaps. 6 and 7). 

To conform to the use of homogeneous coordinates, 2x2 matrices representing transformations of the 
plane can be augmented to use homogeneous coordinates as follows: 



Finally, note that even though w'e have a correspondence between the points of the projective plane and 
those of the Cartesian plane, the projective plane and the Cartesian plane have different topological 
properties which must be taken into account in work with homogeneous coordinates in advanced 
applications. 


Three-dimensional Projective Plane and Homogeneous Coordinates 

Everything stated about the two-dimensional projective plane and homogeneous coordinates may be 
generalized to the three-dimensional case. For example, if P 1 (xr l ,y 1 ,z 1 , wq) are the homogeneous 
coordinates of a point in the three-dimensional projective plane, the corresponding three-dimensional 
Cartesian point P(x, y, z) is, for wq ^ 0, 

X = *L v = h . z = — 

wq ' wq wq 
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In addition, if P(x, y, z) is a Cartesian point, it corresponds to the projective point P(x,y, z, 1). Finally, 
3x3 matrices can be augmented to use homogeneous coordinates: 


( t V 

0 \ 

{3x3] 

0 

\ J 

0 

\0 0 0 

• / 


Solved Problems 


A2.1 Describe the space curve whose parametric equations are x = cost, y = sin r, and z — t. 

SOLUTION 

Noting that x 2 +y 2 = cos 2 t + sin 2 / = I (see Fig. A2-13), we find that the x, y variables lie on a unit 
circle, while the z coordinate varies. The curve is a (cylindrical) spiral. 



A2.2 Find the equation of a sphere of radius r centered at the origin (0,0,0). 

SOLUTION 

Let P(x, _v,z) be any point on the sphere. Then the distance D between this point and the center of the 
sphere is equal to the length of the radius r. The distance formula yields 

7(x - 0) 2 + (v - 0) 2 + (z - 0) 2 = r or x 2 +.v 2 + z 2 = r 2 
This is the (implicit) equation of the sphere. 


A2.3 Show that V • V = |V| 2 for any vector V. 

SOLUTION 

If V = al + bJ + cK, then 

V • V = (al + />J + cK) • (al + b J + cK) = a 1 + b 1 + c 2 = |V | 2 


A2.4 Let Vj = 21 — J + K and V 2 = I + J — K. Find (a) the angle between V, and V 2 , ( b ) a vector 
perpendicular to both V! and V 2 , and (c) a unit vector perpendicular to both V', and V 2 . 
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SOLUTION 

(a) We use the formula 


cos 6 : 


V. -Vt 

|V,||V 2 | 


Now 


and 


IV, 


/2 2 + (-l) 2 +(l) 2 = I V 2 1 = \/l 2 + l 2 + (— l) 2 = >/3 


V, • V 2 = (2)(l) + (— 1)(1) + (!)(— 1) = 0 
Thus cos 6 = 0, and so 9 = 90°. So the vectors are perpendicular. 

( b ) The vector V| x V, is perpendicular to both V, and V 2 . So 


I J K 

V, x V 2 = 2 -1 1 

I 1 1 


-i i : 
i i 


i - 


2 1 

I 1 


J + 


2 -1 

I 1 


K = -21 - J + 3K 


(c) Since V, x V, is perpendicular to both V , and V 2 , we find a unit vector having the direction of V , x V 2 . 
This is 


u v> 


Vi xV 2 

IV, x V 2 | 


From part (b), we have 


So 


IV, x V 2 | = -Ji—l) 1 + (— l) 2 + (3) 2 - n/T4 


_ -2 1 3 

Uv ' xv ’- _ 7n 1_ 7tt J + 7Tt K 


A2.5 Find the equation of the line passing through two points P„(x 0 , _v 0 , z 0 ) and P,(.y, ,v,, z, ). 

SOLUTION 

To find the equation of a line, we need to know a point on the line and a vector having the direction of the 
line. The vector detennined by P 0 and P\, PoP, clearly has the direction of the line (see Fig. A2-1 4), and point 
P 0 lies on the line, so with direction vector 

P«P, = (*1 - *o)' + L V I ->o) J + U| - -o) K 
and point P () (x 0 ,y 0 , z n ), the equation is 

x = x 0 + (a:, - x 0 )t y= y 0 + (.v, — v 0 )r z = z 0 + (x, - 

A2.6 Find the equation of the line passing through P^l, —5, 2) and P,( 6, 7, -3). 

SOLUTION 

From Prob. A2.5, the direction vector is 

P^P7 = (6- 1 )I + [7 — (— 5)JJ + (—3 — 2)K = 51 + 12J-5K. 

Using point P 0 (L — 5, 2), we have x = I + 5/, y = — 5 + 12t, and z = 2 — 5l. 

A2.7 Let line segment L } be detennined by points P,(«, , b x ,c x ) and P 2 (fl 2 , b 2 , c 2 ). Let line segment L 2 
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L 



Fig. A2-14 


be determined by points Q\(u x , *?, , nq) and Q 2 {u 2 , v 2 , m' 2 ). How can we determine whether the line 
segments intersect? 

SOLUTION 

The parametric equations of L, are (Prob. A2.5) 

x = a t + (a 2 — fli)s 
y = b, + (b 2 - b,)s 
Z = Cj + (c 2 -c,)s 

The equations of L 2 are 

X = M] + (u 2 - u, )t 

y=Vj +(v 2 -v,)/ 
z = w 1 +(w 2 - w,)t 

Equating, we find 

(« 2 — Uj)l — (a 2 — a x )s = a t — Uj 
(v 2 - u, )/ - (b 2 -h\)s = b\- a, 

(M' 2 - W, )t - (C 2 - C, )S = Cj - w, 

Using the first two equations, we solve for s and t: 

t _ (h ] - D] )(n 2 - tii) - («i - uQ{b 2 - bj) 

(a 2 - «,)( v 2 - v,) - (b 2 - b,)(u 2 - «,) 

(bj - - «i) - - «i)(»2 - H) 

s — 

(a 2 - fli)(t> 2 - »,) - (b 2 - b x )(u 2 - Mj) 

We now substitute the s value into equation L } and the I value into equation L 2 . If all three corresponding 
numbers x, y, and z are the same, the lines intersect; if not, the lines do not intersect. Next, if both 0 < s 
and 0 < t < 1, the intersection point is on the line segments L l and L 2 , between I\ and P 2 and Q x and 

A2.8 Show that the equation of a plane has the implicit form Ax + By + Cz + D — 0, where A. B. and C 
are the components of the normal vector. 


tO IA 
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SOLUTION 

The equation of a plane with normal vector N = /l I + B.i + CK and passing through a point 
P<>(*o>.v<»-o) is 

A(x — x 0 ) + B{ v — v () ) + C(z — z 0 ) = 0 or Ax + By 4 - Cz + (—Ax 0 - By 0 — Cz 0 ) = 0 

Calling the quantity D = (— Ax 0 — By u — Cz 0 ) yields the equation of the plane: 

Ax + By + Cz + D = 0 


A2.9 Given the plane 5x — 3_y + 6z = 7: (a) find the normal vector to the plane, and (b) determine 
whether P ] (\, 5, 2) and Pxi— 3, -1,2) are on the same side of the plane. 

SOLUTION 

Write the equation in implicit form as 5x — 3.v + 6z — 7 = 0. 

(a) From Prob. A2.8, the coefficients 5, —3, and 6 are the components of a normal vector, that is, 
N = 5I-3J + 6K. 

( b ) Let /( x, y, z) = 5.r — 3 y + 6z — 7. The plane has two sides, R~ where/ (x, y, z) is positive and R~ where 
/(x,y,z) is negative. Now for point P/l. 5, 2), we have 

/(l, 5,2) = 5(1) — 3(5) + 6(2) — 7 = — 5 

and for point P 2 (~ 3, —1,2), 

/(— 3, -1,2) = 5(— 3) - 3(— 1) + 6(2) - 7 = -7 
Since both /(l, 5, 2) and/(— 3, — 1, 2) are negative, I\ and P 2 are on the same side of the plane. 


A2.10 Find the equation of a plane passing through the point P 0 (l, —l, 1) and containing the vectors 
Vj = I - J + K and V 2 = -1+ J + 2K (see Fig. A2-15). 



SOLUTION 

To find the equation of a plane, we need to find a normal vector perpendicular to the plane. Since V, and 
V 2 are to lie on the plane, the cross product V, x V, perpendicular to both Vi and V 2 can be chosen to be the 
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normal vector N (see Fig. A2-15). So 


N = V, x V- 


I J K 
1 -1 1 

-1 12 


-1 2 


-31 - 3J +0K 


So with N = — 31 — 3J and the point P 0 ( 1,-1, 1 ), the equation of the plane is 

-3(x- 1) — 3[>- — (—!)] + 0(z- 1) = 0 or — 3 jc — 3>- = 0 

Finally, x+y = 0 is the equation of the plane. This is an example of a cylinder surface, since z is a free 
variable and y = —x. 


A2.ll Find the equation of the plane determined by the three points Poll, 5, -7), P\(2, 6. I ), and 
P 2 ( 0, 1, 2) (see Fig. A2-16). 



Fig. A2-16 


SOLUTION 

To find the equation of a plane, we must know a point on the plane and a normal vector perpendicular to 
the plane. 

To find the normal vector, we observe that the vectors P„Pi and P 0 P 2 lie on the plane, and so the cross 
product will be a vector perpendicular to both these vectors and so would be our choice for the nonnal vector; 
that is, 

N = P 0 i\xP 0 i i ; 

Now 

Pp*7 = (2- 1)1 + (6 - 5)J + (1 — (— 7))K = I + J + 8K 
and 

P^ = (0 - 1)1 + (1 - 5)J + (2 - (— 7))K = —I - 4J + 9K 
So 
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So N = 411 — 1 7 J — 3K, and with point P 0 (l. 5. —7), the equation of the plane is 

41(x — 1) — 17(y — 5) — 3[z — (—7)] = 0 or 41.v - 17v - 3z + 23 = 0 


A2.12 Show that the equation of the plane that has x, y, and z intercepts A(a, 0, 0), B(Q. h. 0), and 
C(0, 0, c), respectively, is (see Fig. A2-17) x/a +y/b + zfc — 1. 



Fig. A2-17 


SOLUTION 

As in Prob. A2. 1 1, we form the vectors AB = — al 4- bJ and AC = — ol + cK. The nonnal vector to the 
plane is then 


N = AB x AC = 


1 

—a 

—a 


J 

b 

0 


K 

0 

c 



°ll- 

~ a 

J "f 

—a 

b 

<•1 

—a c | 

—a 

0 


K = bc\ + acJ + abK 


The equation of the plane with this nonnal vector and passing through A(a. 0, 0) is 

bc(x — a) + uc(y — 0) + ab(z — 0) = 0 or hex + acy + abz = abc 
Dividing both sides by abc, we have x/a +y/b + z/c = 1 . 


A2.13 Find the distance from a point (jt) , y t , Zj) to a given plane (see Fig. A2-18). 

SOLUTION 

Let N = «, 1 4- n 2 J + n i K be the nonnal vector to the plane, and let P 0 (x„,y 0 , z 0 ) be any point on the 
plane. The equation of the plane is 


«[(x - *o) + n 2 (y - y 0 ) + « 3 (- - =o) = 0 


The distance D from P t (x ( , yj , z, ) to the plane is measured along the perp>endicular or nonnal to the plane. 
Let L n be the line through P { (x t , V] , Zj ) and having the direction of the nonnal vector N. The equation of 
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L w is 

X = JC] + H| t 

L n : v = V] + n 2 t 
z=z, + n 3 l 

We first find the intersection point Pj(x l7 y I? Zj) of the line Z v with the plane. The distance from the point 
P\ (x | , Vj . z, ) to the plane will be the same as the distance from the point P, , V] . 2 ] ) to the intersection point 
P,(x,.y,,z l ). 

Substituting the equations of the line L N into the equation of the plane, we find 


«l(*l + n x t - x„) 4- n 2 {y x 4- n 2 t —y 0 ) + n 3 (z 3 + n 3 t - z u ) = 0 


Solving for /, we have 




n i( x i ~ x o) + « 2 (Vi — Vo) + «i(-i - -o) 


«1 + «2 + n 3 

Calling this ntunber l h we find that the coordinates of P t are 

x, = x, 4 n x t, v, = v, + n 2 t, z, = z, 4 n 3 t, 
The distance D from P(x x ,y 3 ,z 3 ) to P 2 (x l7 y j7 z t ) is 

D = y (xj — x 1 ) i + (vy — V] )' + (-/ — 2i )' 

From equation (A2.2), we obtain 


(A2.2) 


x,-x x = «, 1 1 y l - y, = n 2 t , z, - z, = n 3 t, 

Substitution into the fonnula for D yields 

D = yjdhlj) 2 + (ihtj) 2 + (n 3 tj) 2 = \t[\yjn\ + n\ 4n| 

or, substituting for 

g _ |n,(x, -x 0 ) + » 2 CV| ~ Vq) + »3(*i ~ 2 o)I 
s/n\ + n\ + n\ 
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Fig. A2-19 


To find the number k, we use the tact that V p lies on the plane, so N is perpendicular to V„, i.e., V p ■ N = 0. 
So 

V • N V • N , 

0 = V„ - N = V ■ N - A-(N ■ N) or k=—— = =- (since N • N = |N| 2 ) 

p N • N |N| 2 

Then 

/V ■ N\ 

Vp=v -(w r WJ) 


A2.15 Let a plane be detennined by the normal vector N = I — J + K and a point P 0 ( 2, 3 — 1). 
(a) Find the distance from point A* , ( 5 . 2, 7) to the plane. 
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( b ) Let V = 21 + 3J — K be a vector. Find the projection of V p (in the direction of the nonnal) onto the 
plane. 

SOLUTION 


(a) The vector P 0 P, = 31— J + 8K. From Prob. A2.13 we have 


IN-PoPil 

|N| 


1(0(3) + (— 1)( — 1) + U)(8)| = 12 = 4 ^ 

^/(l) 2 +(-l) 2 + (l) 2 ^ 


(/>) From Prob. A2.14, the projection vector V p is given by 



Now 


So 


V • N _ (2)( 1) + (3)(— 1) + (—!)(!) _ —2 
|N | 2 (l) 2 +(-l ) 2 + (l ) 2 3 

V p = (21 + 3J - K) - (- |)(I - J + K) 

= (21 + 3J - K) - (— |I + §J -|K) = f 1+2 J - lK 


A2.16 Given vectors A and B that are placed tail to tail we define the perpendicular projection of A onto B 
to be the vector V shown in Fig. A2-20. Find a formula for computing V from A and B. 



SOLUTION 

We first find (see Fig. A2-20) 

Using the unit vector 


|V| = |A| cos(0) = | A | 


A B A B 


I A|| B| |B| 


V = |V|U v 

Since V and B have the same direction, we have U v = U B . Hence 

ABB A B 

v = |V|,; -=isn5i=TiF l 


A2.17 Let (3, 1,-3) be the coordinate of point A. Find a point B on the line y — 2x in the xy plane such 
that the line connecting A and B is perpendicular to y = 2.v. 
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SOLUTION 1 


Since point B is on y = 2x, it has coordinates (x, 2x, 0). We introduce two vectors V, and V 2 (see Fig. 
A2-21): 

V, = xl + 2xJ 

V 2 = (3 - x)I + ( 1 - 2x)J - 3K 
The line connecting A and B is perpendicular to v = 2x if 

Vi . V 2 = 0 


or 


x(3 — x) 4- 2x( 1 — 2x) = 0 

This yields x(l — x) = 0. Since the angle between the line y = lx and the line from the origin to point A is 
verifiably not 90 , we have x A 0. Hence x = 1, and the coordinates of point B are (1,2,0). 



SOLUTION 2 

Referring to Fig. A2-21 , let A be a vector whose tail is at the origin and whose head is at point A. From 
Prob. A2.16, we can see that V) is simply the perpendicular projection of A onto V, itself. Since 
A = 31 -F J — 3K and V, = xl + 2xJ, the projection V = V 2 is given by 


\ ■ Vi 3 Y -4- ^ Y 

V, = -r\j = — (xl + 2xJ) = I + 2 J 

1 |V,1 2 1 x 2 + 4x 2 


This means that the coordinates of point B are (1,2,0). 


A2.18 Let a = |A| and h = |B|. Show that the vector 


a B + bA 
a + b 


bisects the angle between A and B. 

SOLUTION 1 


oB -)- b . A a/>U B + baU A 
a+b a+b 


ab 

a + b 


(U B + L a ) 


Since vector C is in the direction of the diagonal line of the diamond figure formed by the two unit vectors U A 
and U B (see Fig. A2-22), it bisects the angle between U A and U B , which is also the angle between A and B. 
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SOLUTION 2 

Let a be the angle between A and C, /i be the angle between B and C, and c = |C|. We have 

aB bA 


cos(a) = 


A C a + b a ^ ® + bA A A B + ba 

ac ac ac(a -f b) c(a 4- b) 


and 


B 


aB + M 


,,, B ■ C a 4-h uB B + 4A B ab + A ■ B 

COS (i) = — = u-ru _ — _ _ — _ — — 

be be bc(a + b ) c(a + b) 

Comparing the two expressions we get cos(a) = cos(fi), or a = fi. 



(Prob. A 1.20) 
(Prob. A 1.1 9) 


A2.19 Prove the formula V, • V 2 = |Vj ||V 2 | cos (8), where V, and V 2 are two vectors and 6 is the smaller 
angle between Vj and V 2 (when the vectors are placed tail to tail). 

SOLUTION 

Since V • V = |V| 2 for any vector V (see Prob. A2.3), we have (see Fig. A 2- 23): 

IV, -V 2 | 2 = (V, - V 2 ) • (V, - V 2 ) 

= V , .(V, -V 2 )-V 2 -(V, -V 2 ) 

= V, ■V,-2V, -V 2 +V 2 -V 2 
= |V,| 2 — 2V, -V 2 + |V 2 | 2 

On the other hand, using the Law of Cosines (see Sect. A 1.1), we have 

IV, - V 2 | 2 = |V,| 2 + |V 2 | 2 - 2|V,||V 2 |cos(0) 

Comparing the two expressions we get V, • V 2 = |V, ||V 2 | cos(d). 


/ v,-v, 


A2.20 Use vectors to show that, if the two diagonals of a rectangle area perpendicular to each other, the 
rectangle is a square. 

SOLUTION 

Let the lower left corner of the rectangle be at the origin and the upper right corner be at (x, y) or (x, r, 0). 
The two diagonals of the rectangle can be expressed as V, = xl + y J and V 2 = xl — yJ. When the two 
diagonals are perpendicular to each other, we have xx — yy = 0, or x =y. Hence the rectangle is a square. 
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A2.2 1 (a) What three-dimensional line determines the homogeneous coordinate point ( 1 , 5, — 1 )? (b) Do 

the homogeneous coordinates (1,5, — 1) and (—2, —10, —3) represent the same projective point? 

SOLUTION 

( a ) The line passes through the origin (0,0,0) and the Cartesian point (1,5, —1). So x = t,y= 5i, and 
z = —I is the equation of the line. 

(h) The homogeneous coordinates represent the same projective point if and only if the coordinates are 
proportional, i.e., there is some number t so that —2 = (1)/, — 10 = (5)/, and —3 = (— I )t. Since there is 
no such number, these coordinates represent different projective points. 



Answers to Supplementary Problems 


Chapter 2 

2.42 No, since there is a change in aspect ratio (5/3.5 ^ 6/4). 

2.43 Yes, since 5.25/3.5 =6/4=1 .5. 

2.44 Present the image at an aspect ratio that is lower than the original. 

2.45 int i,j, c, rg/>[3j; 

for ( / = 0: j < height; j++) 
for (/' = 0; / < width; /++) { 
getPixel(/, ./', rgb): 

c = 0.299*rgb[0] T- 0.5S7*rgb[ I ] + 0.144*rg/>[2]; 
setPixel(/, j, c); 

} 


Chapter 3 
3.35 



3.36 1 . Compute the initial values. Prior to passing the variables to the line plotting routine, we exhange x and y 

coordinates, (x, v) giving (y, x). 

dx = y, - y 2 Inc, = 2 dy _ 

dy = x, — x 2 Inc 2 = 2 (dy — dx) ' 


2. Set (x, y) equal to the lower left-hand endpoint and x cnd equal to the largest value of x. If dx < 0, then 
y = x 2 , x = y 2 , x end = yj . If dx > 0, then y = x, , x = y, , x end = y 2 . 

3. Plot a point at the current (v,x) coordinates. Note the coordinate values are exchanged before they are 
passed to the plot routine. 

4. Test to determine whether the entire line has been drawn. If x = x end , stop. 

5. Compute the location of the next pixel. If d < 0, then </ = </ + lnc x . If d > 0, then d = d + Inc 2 , 
y =y + 1. 

6. Increment x: x = x + 1 . 

7. Prior to plotting, the (x,y) coordinates are again exchanged. Plot a point at the current (x,y) coordinates. 

8. Go to step 4. 


3.37 1. Set the initial values: (xj , y t ) = start of line; (x 3 ,y 3 ) = end of line; a = tan 1 ((y 3 — V] )/(x 3 — x, )); 

d = length of dash; c = length of blank. 

2. Test to see whether the entire line has been drawn. If x t > x 3 , stop. 
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3. Compute end of dash: 

x 2 = Jt| + rfcos(a) 
v 2 = V] + d sin(a) 

4. Send (x, , V| ) and (.v 2 , y 2 ) to the line routine and plot dash. 

5. Compute the starting point of the next dash: 

.V| = x 2 + c cos(a) 

Vi =y 2 4- r sin(a) 

6. Go to step 2. 

3.38 See Fig. S-l. Solving for 0 = n/4\ 

x = 2 cos(7t/4) + 0 = 1.414 y = 1 sin(7t/4) + 0 = 0.7071 

Solving for 0 = 37r/4: 

x = 2 cos(3tt/ 4) 4- 0 = — 1 .4 14 v = 1 sin(3?r/4) + 0 = 0.707 1 
Solving for 0 = 5n/4: 

x = 2cos(57t/4) + 0 = — 1.414 y = 1 sin(57t/4) 4- 0 = —0.7071 
Solving for 0 — 7n/4: 

x = 2cos(77t/4) 4- 0 = 1 .414 v = 1 sin(77t/4) 4- 0 = —0.7071 

y 


(- 1 . 414 , 0 . 


(- 1 . 414 , - 0 . 7071 ) 


Fig. S-l 

3.39 (a) Step 3 should be changed to read 

x = a cos(fl) — b sin^O 4- 4- h 

V = b sin(0) 4- a cos^d 4- ^ 4- k 




ANSWERS TO SUPPLEMENTARY PROBLEMS 


323 


3.40 


3.41 


3.42 


3.43 


(b) Step 3 should be changed to read 

x = a cos(0) — b sin^P + ^ + /; 
y = />sin(0) + acos(6 l 2 3 4 5 6 7 8 + ^ +k 

(c) Step 3 should be changed to read 

x = a cos(0) — b sin^fl + ^ + h 
y = b sin(0) + a cos^Q ^ + k 

Note that rotating an ellipse n/2 requires only that the major and minor axes be interchanged. Therefore, the 
rotation could also be accomplished by changing step 3 to read 

x = h cos 6 y = a sm(fl) 

1. Set initial variables: a = radius, (h, k) = coordinates of sector center, 0\ = starting angle, 0 2 = ending 
angle, and i = step size. 

2. Plot line from sector center to coordinates of start of arc: plot (/;, k) to (a cos(0 , ) + It, a sinl/i, ) + k). 

3. Plot line from sector center to coordinates of end of arc: plot (//. k) to (a cos(d 2 ) + It, a sin (6 2 ) + //). 

4. Plot arc. 

When a region is to be filled with a pattern, the fill algorithm must look at a table containing the pattern before 
filling each pixel. The correct value for the pixel is taken from the table and placed in the pixel examined by 
the fill algorithm. 

The human brain tends to compensate for deficiencies in models. For example, although the cube shown in 
Fig. S-2 is lacking the visual cue, convergence, it is perceived as a cube. When the choice of aliasing is 
inconsistent, the brain either eannot decode the model or can decode it only with difficulty because there is no 
one rule that can be learned to compensate for the inconsistencies of the models. 



1 . Initialize the edge list. For each nonhorizontal edge, find 1 /m(= Ax/Av), y lnax , .y mill , and the x coordinate 
of the edge's lower endpoint. 

2. Begin with the first scan line y. 

3. If y is beyond the last scan line, stop. 

4. Activate all edges with y min —y and delete all edges for which y > y max . 

5. Sort the intersection points by x value. 

6. Fill the pixels between and including each pair of intersection points. 

7. Increment x by 1 /m for each active edge. 

8. Increment y by 1 and go to step 3. 
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3.44 Overstrike can be eliminated by checking each pixel before writing to it. If the pixel has already been written 
to, no point will be written. Or better yet, design scan-conversion algorithms that do not result in overstrike. 


Chapter 4 

4.19 

o _ ( cos 0 —sin# \ . a _ ( cos(— #) —sin (— #) \ / cos# sin#\ 

0 ~ \ sin# cos # ) lU1 ~~ \ sin(— #) cos (— #) ) — \ —sin# cos # ) 

Also 

( cos 0 —sin # \ / cos # sin # \ 
sin # cos # / V — sin # cos # / 

( (cos 2 # + sin 2 #) (cos # sin # — sin # cos #) 

(sin # cos 0 — cos # sin #) (sin" # 4- cos 2 #) 

Therefore, R g and R_ e are inverse, so R_ e = Rjj 1 . In other words, the inverse of a rotation by # degrees is a 
rotation in the opposite direction. 

4.20 Magnification and reduction can be achieved by a uniform scaling of s units in both the X and Y directions. If 
s > I , the scaling produces magnification. If ,v < 1 , the result is a reduction. The transformation can be written 
as 

(x, v) i— *• (sx, sy) 

In matrix fonn, this becomes 

(» 0CM“) 

(a) Choosing s = 2 and applying the transformation to the coordinates of the points A, B, C yields the new 
coordinates /('(O. 0), B'( 2, 2), C’(10, 4). 

(b) Here, s = | and the new coordinates are A'\ 0, 0), B"(j , j), C"(f , 1 ). 

4.21 The line y = x has slope 1 and y intercept (0, 0). If point P has coordinates (x. y), then following Prob. 4.10 we 
have 

Mi ‘ />= (l i)(y) = (x) ° r M L (x,y) = <y,x) 


4.22 The rotation matrix is 

(* o 

2 2 

= V2 v/2 

2 2 

\ 0 0 1 




The translation matrix is 


1 0 1 
0 1 0 
0 0 1 
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4.24 We express the general form of an equation in the xV coordinate system as F(x'.y') = 0. Writing the 
coordinate transformation in equation form as 

V = q(x,v) 3/ = r(x,y) 

and substituting this into the expression for /■’, we get 

F(q(x.y), r(x,y)) = 0 

which is an equation in xy coordinates. 


4.25 


SOLUTION 1 

Let 

We have 

and 

Since 

we also have 

Therefore 

SOLUTION 2 

Let 


v \ = L, I + y J and V 2 = t x l + t y , J 
Ty ■ T ’y{x,y) = f,.- (X + L,,y 4- /,,,) : = (x + L, + . y + fo, + f V| ) 

TV, • TV, (x.y) = 7V, (x + y , y + y) = (x + y + L, .J + L, + V, ) 

y + TV = (L, + L 2 )I + (L, + L, ) J 

T’i' + f 2 (x.v) = (x + y + y •>' 4- t V] + y ) 

Ty, ■ Ty 2 = Ty 2 ■ T Vi = T Vf+ r 4 


V, =t x \ + t y 3 and V 2 = t x l + yJ 
and express the translation transformations in matrix form 



0 y \ 



o y\ 

o 

t-ir 

1 V, 

and 

24 

II 

O 

1 y 

\o 

0 1 J 


\o 

0 1 / 


we have 


1 0 F 


0 L 


1 0 ‘x 2 + V, 


Ty x T Vl =\ 0 1 y| |0 1 y | = | 0 1 y+y 


oo i / \o o i 


o o 


and 


I 0 L 


0 t. 


T v Ty. = | 0 1 I I 0 I u 0 


oo i / V o o i 



1 0 


0 0 


V, + y \ 


Also since 


e, + f 2 = tv, + y )i + (y + y)J 
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we have 

/ 1 0 t Xi + A, \ 

r i'i+L = I 0 1 'y, + ‘n 

Vo 0 1 / 

Therefore 

TV, • = TV, ■ T ( , = r K)+f / 2 


4.26 


4.27 


SOLUTION 1 

Since 

^.6 • S c ,j( x ' y) = S a b (cx , dy) = (acx, bdy) 

s c ,d ■ Sa.b&y) = $:.<<(«*• *>’) = ( cax, dby ) 

S acM (x,y) = (acx, bdy) 
we have S a b • S cd = S cd ■ S a h = S acM . 

SOLUTION 2 


Express the scaling transformations in matrix form 


s *.»“(o !)• 


Since 


and 


S ^-(o ")• 

s -» s -=(; °) ( o 


”)(o 

we have S ab ■ S c d = S c d - S a b = S uc M . 

Express the rotation transformations in matrix form 


we have 


-sin(a) \ 

and 

cos(a) y 

— sin(a) \ 

/ cos(/f) 

cos(a) / 

V sin(/?) 


/ cos(a) cos(/i) — sin(a) sin(/i) 
V sin(a) cos(/i) -f cos(a) sin (/?) 



0\ _ lac 0 \ 

d ) ~ V 0 bd ) 


0 \_( c <i 0\ 

b ) ~ \ 0 db ) 


r, _ ( cos (/?) — sin(/i) \ 

11 V sin(/?) cos(^) / 

— sin(/J) \ 
cos (P) ) 

— cos(a) sin(/l) — sin(a) cos(/J) 
— sin(a) sin(/i) +- cos(a) cos(/J) 


and 


/ cos(/i) — sin(/i) \ /cos(a) — sin(a) \ 

V sin(J?) cos(P) ) \ sin(a) cos(a) / 

( cos (/i) cos(a) — sin(/i) sin(a) — cos(/i) sin(a) —sin (/i) cos(a) \ 

sin(/l) cos(x) -+- cos(/f) sin(a) — sin(/i) sin(a) + cos(jS) cos(a) / 


Using trigonometric identities 


cos(a 4- /i) = cos(a) cos(/?) — sin(a) sin(/i) 
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and 


we have 


sin(a 4 ft) = sin(a)cos(/f) 4 cos(a) sin(/i) 


R tt -Rf = Rp-R a = 


/ cos(a 4- /i) 
y sin(a + J?) 


-sin(a 4 /?) \ _ „ 
cos(a 4 fi) ) 01+11 


4.28 First express scaling and rotation in matrix form 


A D ( cos(0) -sin(O) \ 
and *Hsin(0) cos(0) ) 


we have 


and 


S ^-(o sj 

C D _ ( s * °\ ( cos < fl ) — sin(0) \ _ / S x cos(O) -s x sin (6) \ 
J '»4 — y 0 s y J \ sin(0) cos(0) J ~ \ sin(G) s v cos(0) ) 

_ ( cos(0) — sin(^) \ (s x 0 \ _ / cos (0)s 

0 y sin(fl) cos(0) J \ 0 s y J y sin(0> 


-sin(0)s, 


y ^ 

cos(6)s y J 


In order to satisfy 

^s„,s y ' R() — Rfl ' Ss s ,s y 

we need 

s v sin (if) = sin(fl)s r 

This yields 0 = rm, where n is an integer, or s v = s x , which means that the scaling transformation is unifonn. 


4.29 No, since 


and 


Co :)=cr ;mi “) 

(i ?)-(i 0“(i >) 


4.30 A rotation followed by a simultaneous shearing can be expressed as 

/ cos(0) — sin(0) \ /I a\ _ / cos(0) — b ■ sin(0) a • cos(0) — sin(0) \ 
y sin(fl) cos (6) J yb 1 ) \ sin(fl) 4 b ■ cos(0) a ■ sin(0) 4 cos(0) J 

On the other hand, a simultaneous shearing followed by a rotation can be expressed as 

/I a \ / cos(d) — sin(0) \ _ / cos(0) 4 a ■ sin(d) — sin(fl) 4 a ■ eos(fl) \ 

\ b 1 ) y sin(0) cos(0)/ y b ■ cos(0) 4 sin(0) — b ■ sin(0) 4 cos(0) J 

In order for the two composite transformation matrices to be the same, we need 

cos(fl) — b ■ sin(fi) = cos(O) 4 a ■ sin(ii) 


or 


—b ■ sin(0) = a • sin(0) 

which means d = nn, where n is an integer, or a = —b. 
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4.31 Consider the following sequence of rotate-scale-rotate transformations 

R s R _ ( cos(a) — sin(a) \ /*, 0 \ / cos (/?) -sin(/i) \ 

1 ^ V sin(«) cos(a) / y 0 s y J \ sin(/i) cos (/?) / 

( cos(a) cos (p)s x — sin(a) sin (p)s y — cos(a) sin(/l).v r — sin(a) cos(/J).v v \ 
sin(a) cos (/i)s x + cos(a) sin (/?)s, — sin(a) sin (ff)s x + cos(a) cos (fl)s y I 

By equating the composite transformation matrix on the right-hand side to the matrix for a simultaneous 
shearing transformation 

0 0 

we have four equations that can be solved for parameters a, /?, s x and s y . 


4.32 Consider the following sequence of shearing and scaling transformations: 


/ 1 a\ / s x 0\ /I 0 \_/s x + a-b-Sy a ■ s v 
VO lj VO s y ) [b lj - ^ bs y S y 


By equating the composite transformation matrix on the right to 

_ / cos(fl) — sin(0) \ 
e V sin(A) cos(#) J 

we have 

sin(0) sin(#) 1 

cos(0) ’ cos(0) ' * cos (6) ' 


s y = cos (6). 


4.33 Consider the following sequence of shearing transformations: 

/I a, \ (\ 0 \ /I a 2 V_/l+a 1 /) (1 +a\b)a 2 4- a, \ 

^ 0 1 ) ' V b \)\0 l )~\ b ba i + x / 

By equating the composite transformation matrix on the right to 

( cos(#) —sin(Q) \ 

6 \ sin(ft) cos(# J 

we have 

cost#) — 1 

a, = a? = — — and b = sin(#) 

sin(#) 


4.34 Let CTM„ be the composite transformation matrix representing the concatenation of n basic transfonnation 
matrices. We prove, by mathematical induction on n, that CTM„ is always in the following fonn 

a b c 
d e f 
0 0 1 

n = 1: The basis case is true since CTM, is simply a basic transformation matrix, which fits into the given 
template. 

n = k: Suppose that CTM,. is indeed in the specified fonn. 

( a k b k c k \ 
d k e k f k 

0 0 \) 
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n = k + I : We now show that CTM f+1 is in the same form: 


CTM* +1 = 



• CTM t . = 


' a ■ iif. + b • cl i- a ■ b k + b ■ e k 

d ■ a k + e ■ d k d ■ b k 4- e ■ e k 

0 0 



4.35 Let P l (x'j , y', ) be the transformation of / > ,(x 1 ,„v 1 ) and /'jbi.vi) be the transformation of P 2 (x 2 , v 2 ). Also let 
the composite transformation be expressed as 

a b c 
d e f 
0 0 1 

we have 

x\ = ax, + />V| + c y, = dx | + ey t +/ 
and 

x 2 = ax 2 + hv 2 + c y 2 = dx 2 + ev 2 + / 

Now consider an arbitrary point P(x,y) on the line from P, to P 2 . We want to show that the transfbnned P, 

denoted by P'(x'.y), where x! = ax + by + c and V = dx + ey + /, is on the line between P\ and P 2 . In other 

words, we want to show 

y 2 -y y 2 -y 

x' 2 - x\ x 2 - x' 

which is 

iix 2 + ey 2 +f - - e>’, -/ _dx 2 + ey 2 +f - dx - ey -f 

ax 2 + by 2 + c — ax, — P>v, — c ax 2 + bv 2 + c — ax — by — c 

and is 


Since (x, v) satisfies 


d + e 


V2-V1 
x 2 - x, 


d + e 


-V2-.V 

X-. — X 


x 2 — X, 


a + b 

x 7 — x 


.V2 — .Vj _.V2 —y 

x 2 — X, x 2 — X 


we have established the equality that shows F being on the line between Pj and P 2 


Chapter 5 


5.20 From Prob. 5.1 we need only identify the appropriate parameters. 

(a) The window parameters are ux min = 0, K'X max = 1, H’i> rain = 0, and wv max = 1. The viewport parameters 
are i;x nli „ = 0, rx nl . (X = 199, rv m j„ = 0, and ty milx = 639. Then s x = 199, s v = 639, and 

/ 199 0 0\ 

W = I 0 639 0 

\ 0 0 1 J 

(b) The parameters are the same, but the device y coordinate is now 639 — y (see Prob. 2.8) instead of the y 
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value computed by W in (a) 


/ 199 0 0\ 

W = I 0 -639 639 

\ 0 0 1 / 


5.21 


If s r = s„, then 


vx„ 


vx„ 


Umax Ehnin 
- WVmin 


m:ix V1 3min U max ^3 mul 

M -*raax — "^min l,Jc max ^Utun 


Inverting, we have a w = a, ; . 

A similar argument shows that if the aspect ratios are equal, a„ = a v , the scale factors are equal, s x = s y . 


5.22 


We form N by composing ( 1 ) a translation mapping the center ( 1 , 1 ) to the center (j, i) and (2) a scaling about 
C(/ , j) with uniform scaling factor s = -A so 


N = $ 

’t/io.i/io.c • TV, 


(is 

0 

J 20 

{ 

= 

0 

1 9 

10 20 



[o 

0 \) 

\ 


where v = — jl — 


o -n 


(is ° n 

1 

= 

0 is i 

0 \) 


Vo o i / 


5.23 Let the clipping region be a circle with center at 0(h, k) and radius r. We reduce the number of candidates for 
clipping by assigning region codes as in the Cohen -Sutherland algorithm. To do this, we use the 
circumscribed square with lower left comer at (/; — r, k — r) and upper right corner at (h + r.k + r) to 
preprocess the line segments. However, we now have only two clipping categories- not displayed and 
candidates for clipping. Next, we decide which Ene segments are to be displayed. Since the (nonparametric) 
equation of the circle is (x — h) 2 + (y — k) 2 = r 2 , the quantity K(x,y) = (x — h) 2 -f (y — k) 2 — r 2 detennines 
whether a point P(x, y) is inside, on, or outside the circle. So, if K < 0 for both endpoints P 1 and P 2 of a line 
segment, both points are inside or on the circle and so the line segment is displayed. If K > 0 for either P, or 
P 2 or both, we calculate the intersection(s) of the line segment and the circle. Using parametric representa- 
tions, we find (App. 1, Prob. A1.24) that the intersection parameter is 

-S ± Vs 2 - L 2 C 

Z 2 

where 

L~ = (x 2 — .V] )‘ + (y '2 — Ti )" 

S = (x, - h)(x 2 - Jt,) + (v, - k )(y 2 -Yi) 

C = (x, - h ) 2 + (Y, - kf - r 2 
If 0 5 h < 1, the actual intersection point(s) I(x,v) is (are) 

x = x, + t,(x 2 - x, ) y = y, + t,(y 2 - y, ) 

So, if A' > 0 for either P, or P 2 (or both), we first relabel the endpoints so that P x satisfies K > 0. Next we 
calculate t 2 . The following situations arise: 

1. S 2 - L?C < 0. Then t 2 is undefined and no intersection takes place. The line segment is not displayed. 

2. S 2 -L 2 C= 0. There is exactly one intersection. If > 1 or < 0, the intersection point is on the 
extended line, and so there is no actual intersection. The line is not displayed. If 0 < /, < I, P\P 2 is 
tangent to the circle at point /, so only I is displayed. 

3. S 2 — L 2 C > 0. Then there are two values for t h and tj . If 0 < tj , tj < 1, the line segment I~I + is 

displayed and the segments (assuming f/ > tj) P X I~ and / + / J 2 are clipped. If only one value, say, //, 
satisfies 0 < < 1, there is one actual intersection and one apparent intersection. Since in this case P 2 is 

either point I + or inside the circle, P, I~ is clipped and I^P 2 is displayed. If tf, tf > 1 or if. If < 0, then 
P\P 2 is not displayed. 
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5.24 Following the logic of the Sutherland-Hodgman algorithm as described in Prob. 5.14, we first clip the 
“polygon” P, P 2 against edge AB of the window: 

1 . AB. We first detennine which side of AB the points P, and P 2 lie. Calculating the quantity (see Prob. 
5.13), we have 

C = (x 2 - x, )(v - Vi ) - tv 2 - .V, )(x - -v, ) 

With point A = (x, , v, ) and point B = (x 2 ,y 2 ), we find C = 8 for point P, and C = 2 for point P 2 . So 
both points lie on the left of AB. Consequently, the algorithm will output both P, and P 2 . 

2. BC. Setting point B = (x, , v, ) and C = (Xj. Vt), we calculate C = 13 for point P, and C = —3 for point 
P 2 . Thus P, is to the left of BC and P 2 is to right of BC. We now find the intersection point /, of P, P 2 
with the extended line BC. From Prob. A2.7 in App. 2, we have /, = (4 jg, 3|). Following the algorithm, 
points P, and /, are passed on to be clipped. 

3. CD. Proceedings as before, we find that C = 2 for point P, and C = 6| for point /, . So both points lie to 
the left of CD and consequently are passed on. 

4. DA. Setting point D = (x,,yi) and/1 = (x 2 . y 2 ), we find C = — 3 forP, and C = 10 for/,. Then P, jies to 
the right of DA and /, to the left. The intersection point of P,/, with the extended edge DA is 
1 2 = (^ , 2|). The clipped line is the segment /,/ 2 . 


Chapter 6 


6.9 From Prob. 6.2, we identify the parameters 


Then 


V = fll + bJ + cK = I + J + K 
|V| = s/a 2 + ft 2 + c 2 = yU + U + P = s/3 

1 = s/b 2 + c 2 = v 7 1 2 + 1 2 = s /2 


Ay — 


(s /2 

-1 

-l 

o' 

s /3 

s/ 2 . vX 

vXvX 

0 

1 

7! 

-1 

7! 

0 

1 

i 

i 

0 

7! 

7! 

7! 

V 0 

0 

0 

1/ 


6.10 From Prob. 6.5, A v N = /1 N ' Ay. We find Ay first. From Prob. 6.2 we identify the parameters |V| = \/3. 
A = s/2, a = 1, 6 = 1, c = 1. So 


^v 


(s /2 

s /3 

0 

1 

vX 

V 0 


-1 

vXy! 

i 


vX 


vX 

0 


-1 

vXvX 

-1 

n 

1 

vf 

0 


o 

o 

i> 
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For^ N \ we have |N| = v/ 6 , A = s/2, a = 2, b = — 1, and c = — 1. So 


Note that V' =^ v ,n • 
image of V under A V N 


/ n/2 

s/6 
2 


AZ' = 


-1 


2 

76 

-1 


s/2.76 s/2 s/6 
2 1 -1 
s/2.76 72 76 
V o oo 


0 

0 

iy 


r- s/2 72 , s/2 

Ay ■ V = 721 -\ J - K so that V = — 
2 2 2 

is not the vector N, but a vector that has the direction of N. 


--a7 


N. In other words, the 


6.11 This follows from comparing the matrices Ay 1 with Ay from Prob. 6.2. 

6.12 If we place vectors V and N at the origin, then from App. 2, V x N is perpendicular to both V and N. If 6 is 
the angle between Y T and N, then a rotation of 6° about the axis L whose direction is that of V x N and which 
passes through the origin will align V with N. So /1 V N = L . 

6.13 As in the two-dimensional case in Chap. 4, we reduce the problem of scaling with respect to an arbitrary point 
P 0 to scaling with respect to the origin by translating P 0 to the origin, performing the scaling about the origin 
and then translating back to P 0 . So 

S s . , u — TZp ■ S, s , ■ T_ P 

s x> Sy,s r ,r o ■* o s x ,Sy,s. i o 


Chapter 7 


7.16 From Prob. 7.5, we need to evaluate the parameters ( a.b,c ), («], n 2 , «■<)> (</. d n , d ] ) to construct the 
transformation. From the equations in Prob. 7.6, part (b) [denoting the principal vanishing points as 
/- > i(.r 1 ,v'i,e 1 ), P 2 (x 2 .y 2 . z 2 ), and Pji-x^.yy. z 3 )], we find a = x 2 (or ,v 3 ), b = V] (orya), and c = c, (or z 2 ). Also 

d d d 

«1 = »2 = 7 «3 = 

x x - a y 2 - b *3 — c 

To find </, d 0 , and d } , we note (App. 2, Prob. A2.13) that the distance D from the point C{a, b. c ) to the plane 
can be expressed as D = |rf|/|N|, where |N| is the magnitude of N. Since we need only find the direction of 
the nonnal N, we can assume |N| = 1 . Then d = ±D. The choice ±, based on the definition of d in Prob. 7.5, 
is dependent on the direction of the normal vector N, the reference point R f) , and the center of projection C. 
Since these are not all specified, we are free to choose, and we shall choose the 4- sign, that is, d — D. Finally 

d { = rifQ + n 2 b + n 2 c and d u = d + (/, 


7.17 We use the coordinate matrix V constructed in Prob. 7.1 to represent the unit cube. 

(a) From Problem 7.14, the isometric projection matrix Par is applied to the coordinate matrix V: 


I 0 


Par • V = 


0 


i: 


/23/21/2 /2 1/2 3/2 ^ [2 

V 3 2 V 3 2 V 3 V 3 2 V 3 2 V 3 2 V 3 




0 

0 

1 


72 72 

2 2 
0 0 


— s/2 -72 

2 2 

0 0 

1 1 


0 

0 

I / 
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This is the matrix of the projected vertices, which can now be read off (see also Fig. S-4). 


A = ( 0 . 0 . 0 ) 




(/>) To produce a dimetric drawing, we proceed, as in part (a), by using the dimetric transformation Par from 
Prob. 7.15. Choosing the projection ratio of j : 1 : 1 (i.e., 1 = \), we have 


/s/2 

VT4 

714 

3 

~ 6 ~ 

6 

0 

72 

-72 

2 

2 

0 

0 

0 

^ 0 

0 

0 


0 




0 


: 


The projected image coordinates are found by multiplying the matrices Par and V: 



Fig. S-4 


Fig. S-5 
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The image coordinates are (see Fig. S-5) 


A' =(0,0. 0) 


B' = 
C' = 

D' = 


(?■••) 

^ 2sj2+sf\A ^2 


E = 

(2\fl + \/\A -s/2 
G= { 6 ' — 


7.18 Since the planes we seek are to be located at the origin, we need only find the normal vectors of these planes 
so that orthographic projections onto these planes produce isometric projections. In Prob. 7.14, we rotated the 
xyz triad first about the x axis and then about the y axis to produce an isometric projection onto the xy plane. 
Equivalently, we could have tilted the xy plane (and its nonnal vector K) to a new position, thus yielding a new 
view plane which produces an isometric projection with respect to the (unrotated) xyz triad. Using this 
approach to find all possible view planes, we shall use the equations from Prob. 7.14 to find the appropriate 
rotation angles. From the equations 

sin 2 6 X —cos 2 0 X = 0 cos 2 Q y = j [sin 2 9 y + 1 ] 

we find the solutions 

^2 l \ 

sin 0 X = ± , cos 6 X = ± -y- and sin 6 y = ±. cos 0 y = 

From Chap. 6, Prob. 6.1, part (/>), the matrix that produces the tilting is 

( cos 6 V 0 sin 0 V \ 

sin 0 X sin 0 y cos 0 X —sin F v cos 0 y I 

—cos 6 X sin 0 V sin 0 r cos 0 X cos Q y ) 

Applying this to the vector K = (0. 0, 1), we find the components of the tilted vector to be 
x -- sin 6 V y = —sin 6 X cos 0 y z = cos 6 X cos 0 V 
Substituting the values foimd above, we have eight candidates for the normal vector N = xl + yJ + zK, where 

x = ±J\ y=±Jl z=±f- 

However, both N and — N define normals to the same 
view planes (through the origin) with normals 

N, =J^(I + J + K) 

N, = A (-1 4- J + K) 


plane. So we finally have four solutions. These are the 

N, =J^(1-J + K) 

N 4 =J^(I + J-K) 



Chapter 8 

8.11 Referring to Fig. 8-7 (and Prob. 8.4) we call CR the vector having the direction of the line from the center of 
projection C to the window comer R. Similarly, we call CL the vector to the window comer L. Then: 

1 . Top plane — determined by the vectors l p and CR and the reference point R f 

2. Bottom plane — determined by the vectors I p and CL and the reference point L f 

3. Right side plane — determined by the vectors J (| and CR and the reference point R f 
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4. Left side plane — determined by the vectors J q and CL and the reference point L r 

5. Front (near) plane — detennined by the (view plane) normal vector N and the reference point P, 

6. Back (far) plane — detennined by the normal vector IN and the reference point P b 


8.12 Suppose that the plane passes through point R 0 (x 0 , y 0 , z 0 ) and has a normal vector N = n, I 4- n 2 J + n 3 K. Let 

the points P^ (X] , y t , z, ) and P 2 (x 2 .y 2 , - 2 ) detennine a line segment. From App. 2, the equation of the plane is 

"i (x ~ x () ) 4- n 2 (y - y 0 ) 4- n } (z - z 0 ) = 0 

and the parametric equation of the line is 

x=x, +(x 2 -x ] )t y =y l +{y 2 -y i )t z = z, 4- (z 2 - z,)t 

Substituting these equations into the equations of the plane, we obtain 

"it-L + (x 2 - x, )/ - x 0 ] + n 2 [v, + (y 2 - y, )t - y 0 ] 4- n 3 [z, 4- (z 2 - z, )t - z 0 ] = 0 

Solving this for / yields the parameter value t, at the time of intersection: 

( ^ »\(x\ ~x n ) + n 2 (v | -y 0 ) + HjU, - z 0 ) 

1 nfx 2 - x, ) + n 2 (y 2 - v, ) + n } (z 2 - z,) 

We can rewrite this using vector notation as 

t N - Wi 

‘ N P,P 2 

The intersection points 1(X/, Vpzf) can be found from the parametric equations of the line: 

X/ = x, + (x 2 - x, )t, y, = y, + (v 2 - y, )t, = Zj + (z 2 - z, )t, 

If 0 < t/ < 1 , the intersection point / is on the line segment from P ] to P 2 , if not, the intersection point is on 
the extended line. 


Chapter 9 


9.9 Referring to Fig. 7-12 in Chap. 7, we define a vertex list as 

V = { ABCDEFGH } 

and an explicit edge list is: 

E = [AB. AD, AF, BC. BG, CD. CH. DE. EF. EH. FG. GH ) 

The cube can be drawn by drawing the edges in list E. Referring to Prob. 9.2, we note that a typical polygon, 
say, P\ , can be represented in terms of its edges as 

P ] = \ AB.AD.BC.CD } 

The polygons sharing a specific edge can be identified by extending the edge’s representation to include 
pointers to those polygons. For example: 

AB^P„P a AD-r P u P 2 


9.10 


The knot set can be represented as t 0 , t 0 + L, t 0 + 2L, On the interval t. = t„ + (/ — \ )L to 

t i+ 2 = t 0 + (i + 1)1, we have 


B iA (x) = 


x ~ [ip + (/ - 1 )L] 

Uo + iL) — [fo 4- (/ — 1)Z] 


B, n(x) 4- 


[tp 4- (/ 4- 1 )L] — x 
[to + 0 + I )k\ ~ (to + d ) 




On the interval that is, / 0 4- (/— 1 )L < x< 4- iL, we have fi, 0 (x) = 1 and 5, +l 0 (x) = 0. On the 

interval , /, +2 ], that is, ? 0 4- iL < x < t 0 4- (i 4- 1 )L, we have B, 0 (x) = 0 and B, +1 0 (x) = 1 . Elsewhere both 
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Sf.o(-v) = 0 and S, +1 . 0 (x) = 0. So 


B u (x) 


x - t u + (/-! )/- 
L 

to + (/’ + I )L — x 

z 

o 


on 

on 


t 0 + (i - \)L < x < t 0 + iL 

/q + iL < x < tQ + (/ + 1 )L 
elsewhere 


9.11 


From the definition of a B-spline, B, 3 (x) is nonzero only if /, <x< l, +4 . In terms of the given knot set, this 
equates to / < x < i + 4. With x = 5.5, B, 3 (5.5) is nonzero for i = 2, 3, 4, and 5. Now 


S, ,3(5.5) 


(5.5) 


0 + 3) • 


—Bi. 2 ( 5 - 5 ) + 


(i + 4) -(5.5) 
0 + 4) — (t + 1 ) 


B m . 2(5-5) 


3, .2(5.5) = -3 i2 (5.5) + '— pl 5 . +] ,(5.5) 

Starting with i = 2, 

3 2l ,(5.5) = Y 5 2 2 (5.5) + 3,, 2 (5.5) 

Now 5 2 2 ( x) is nonzero if 2 < x < 5. Thus 3 2 2 (5.5) = 0, and so S 2 ,(5.5) = (0.5/3) 5, 2 (5.5). Because 
S 12 (a') is nonzero for 3 < x < 6, we find that 

(5.5) -3 6- (5.5) 

By 2(5.5) = g _ 3 *3,1 (5.5) + 6 _ 4 ' * 4,1 (5-5) 

Now S 3 | (a) is nonzero if 3 < x < 5. So S,,(5.5) = 0. Now B 4 , (x) is nonzero if 4 < x < 6. Thus 

^.2(5.5)=^S 4i1 (5.5) 

Now 

S 4 ,i (5-5) = {5 S 5) _ ~ 4 S 4 .o( 5-5) + 6 ~Yg 5) ^.o(5-5) 

Since B 40 (x) is nonzero if 4 < x < 5, we find that S 4 0 (5.5) = 0. So 

S 4i1 (5.5) = ^S 50 (5.5) 

However, 3 50 (.v)=l if 5 <x£6. So 3 50 (5.5)= 1, 3 4 ,(5.5) = 0.5(1) = 0.5, and 3 3 ,(5.5) = 
(0.5/2)(0.5) = 0.25/2, and finally 

« u {5.5, = y (5|5) = ^ = 0.0208333 

The computations for B, ,(5.5), B 4 ,(5.5), and S 5 ,(5.5) are carried out in the same way. 


Chapter 10 


10.25 The properties of parallel projection can be used to simplify calculations if the objects to be projected are 
transformed into “new objects” whose parallel projection results in the same image as the perspective 
projection of the original object. 

10.26 Since the Z-buffer algorithm changes colors at a pixel only if Z(x, v) < Z buf (jc, v), the first polygon written will 
determine the color of the pixel (see Prob. 10.7). 

10.27 A priority flag could be assigned to break the tie resulting in applying the Z-bufFer algorithm. 
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10.28 A system that distinguishes 2 2 ' 1 depth values would require three bytes of memory to represent each z value. 
Thus 3 x 1024 x 768 = 2304 K of memory would be needed. 

10.29 The scan-line method can take advantage of (o) scan-line coherence, (b) edge coherence, (c) area coherence, 
and ( d ) spatial coherence. 


10.30 Scan-line coherence is based on the assumption that if a pixel belongs to the scan-converted image of an 
object, the pixels next to it will (most likely) also belong to this object. 

10.31 Since this figure is a polygon, we need only find the maximum and minimum coordinate values of the vertices 
A, B, and C. Then 


min = ® 

-*Tiiax “ 

nun 0 

.Vmax ~ 

'min 1 

2 =2 
“max 


The bounding box is shown in Fig. 10-26. 

10.32 Horizontal line segments (v min = .v max ) lie on only one scan line; they are automatically displayed when 
nonhorizontal edges are used in the scan-conversion process. 


10.33 We search the 3 coordinates of the vertices of the polygon for the largest value, z max . The depth of the polygon 
is then z m9V . 


10.34 Area coherence is exploited by classifying polygons with respect to a given screen area as either a surrounding 
polygon, an intersecting polygon, a contained polygon, or a disjoint polygon. The key tact is that a polygon is 
not visible if it is in back of a surrounding polygon. 

10.35 When using a hidden-surface algorithm to eliminate hidden lines, we set the fill color of the polygons, 
determined by the lines, to the background color. 


Chapter 11 

11.25 Let IV, H, D, and P be defined in the same way as in Prob. 11.3. We can calculate saturation using 
DxP/(W xH+Dx P). 

11.26 The color-sensitive cones in our eyes do not respond well to low intensity light. On the other hand, the rods 
that are sensitive to low intensity light are color blind. 

11.27 Yes. We can use 


11.28 The Y in CMY means yellow, whereas the Y in YIQ represents luminance. 

11.29 


N = 


x - x„ 


. , v - v c , : 
I 4 — — J 4- - 


R 


— K 
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11.30 The parametric representation for the target area is 


X = e 

v = 1 .2 cos(<p) 
2 = l.2sin(<p) 


0 < 0 < 3.0 
0 <(p< n/2 


Note the relationship between the comer points: 


11 = 0. 

© 

II 

CD 

t 

o 

II 

</> = n/2 

u= 1, 

w = 0->- 0 = 3, 

II 

ro 

u = 0, 

© 

II 

CD 

t 

<N 

II 

<p = 0 

u= 1, 

w = 2 0 = 3, 

q> = 0 

Substitute these into 0 = Au + B and <p 

= Cw + D, we get 


A= 3, 

B = 0. C = -jt/4. 

D = 

Hence the mapping functions are 



e = 

= 3 u and w = — - 

2 

n 

4 W 


The inverse mapping functions are 



and 


w — 


ft/2 ~ <P 
n/4 


Chapter 12 


12.27 Let /| be the time required for the light ray to travel from A to P, and t 2 be the time required tor the light ray to 
travel from P to B (see Fig. 12-21). We have 


\Jx 2 + \/(*B •*) + ,'fi 

0 = „ h = 

C\ Cj 

To locate P (i.e., to determine x) such that the total travel time t = fj + t 2 is minimal, we find 

d/ x x B — x 


or 


Notice that 0 < x < x B and 


* C '^ 2+ >’t c 2y j(x B -xf +yjf 

df sin(a) sin(/i) 
dx C] c 2 

(a- (!)>• 


These suggest that I reaches a minimum when 

sin(a) sin(jS) 
c, c 2 


= 0 


sin(a) cj 
sin(/i) c 2 


Hence 
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12.28 Yes, these two procedures produce identical results when invoked with the same call. However, for a given 
depth value the procedure in this question involves one more level of recursion than the one in the text. Hence 
the procedure in Sect. 12.2 has better execution efficiency. 

12.29 Find two points on the line 

x = 0 — ► y = 0 — 6 = — 6 
x = 1 — >• v = 2 — 6 = —4 

Use (0, —6) to be the starting point, we have 

s = — 6J 

d = (1 - 0)1 + (-4 - (— 6))J = 1 + 2J 
and the parametric vector equation for the line is 

L(/) = s + rd where — oo < t < -foo 

12.30 Intersection of a ray with the yz plane can be detennined by solving the following for I: 

s -F /d = y,J 4- z,K 

With s = x s I +y s J + z s K and d = x d I + y d J + z d K, we have 

[ U + tx A = 0 
| v» + (Vd = V, 
l -S 4" t-A ~i 

When v d = 0, the ray is parallel to the plane (no intersection). When x s = 0, the ray originates from the plane 
(no intersection). Otherwise, we calculate t using the first equation 

t _ _ x s 

x a 

If / < 0, the negative extension of the ray intersects the plane. On the other hand, if / > 0, the ray itself 
intersects the plane and the coordinates y, and z, of the intersection point can be calculated from the second 
and third equations. 

12.31 Let s = x s I + y„.) + ~ S K and d = x d I +y d J + z d K. Substitute ,r s -+- tx A and y s + ty A for ,r and y, respectively 

(x s + tx A Y + (v s + n d )' — = 0 

Expand and regroup tenns 

(A'd +> d )/ 2 + 2(.v s .r d 4->vv d )f + (a* +>») - R 2 = 0 


or 

At 2 + 2Bt + C = 0 


where 


A = x A +y d B = x s x A d-yj-'d C = x, +)% — R 2 

When A = 0, the ray is parallel to the z axis and does not intersect the cylinder (the entire ray is on the cylinder 
if the starting point is on the cylinder). Otherwise, the solution for the quadratic equation is 

-b±Vb 2 -ac 


B 2 - AC 


< 0 
= 0 
> 0 


no intersection 

ray (or its negative extension) touching cylinder 
two (possible) intersection points 



ANSWERS TO SUPPLEMENTARY PROBLEMS 


341 


The last case ( 5 1 — AC > 0) produces two t values: t\ and h. If /| <0 and t 2 < 0, the negative extension of 
the ray intersects the cylinder (no intersection by the ray). If one of the two values is 0, the ray starts from a 
point on the cylinder and intersects the cylinder only if the other value is positive. If and t 2 differ in signs, 
the ray originates from inside the cylinder and intersects the cylinder once. If both values are positive, the ray 
intersects the cylinder twice (first enters and then exits), and the smaller value corresponds to the intersection 
point that is closer to the starting point of the ray. 

12.32 Substitute 5 in At 2 4 - Bt + C = 0 with D we have 

At 1 + Dt + C = 0 

and the solution is 

-D±s/D 2 - A AC 
‘ ~ 2A 

Now let D = 25, the above equation becomes 

At 2 + 2Bt + C = 0 

and the solution is 

-25 ± y(25) 2 - 4 AC —2B±2\/B 2 — AC -B±s/B 2 -AC 

' ~ 2A “ 2 A - A 




4 -connected, 42 
8-connected. 42 

Adaptive depth control, 259 
Adaptive supersampling, 26 1 
Aliasing, 2, 47 
Ambient Light, 235 
Animation: 
lookup table, 101 
panning, 100 
zooming, 100 
Anti-aliasing, 2, 47, 260 
Anticyclie spline condition, 181 
Approximation, 181 
Bezier— B-spline, 182 
Bezier-Bernstein, 181 
Area coherence, 205 
Area sampling, 49 
Aspect ratio, 6 
Asymmetry, 48 
Axis, 273, 298 

Axonomctric projections, 132 

B-splines, 178 
interpolation, 180 
Back-face removal, 200 
Basis (blending) functions, 177 
B-splines, 178 
Bernstein polynomials, 179 
I lennite cubic polynomials, 1 78 
Lagrange polynomials, 177 
Bernstein polynomials, 179 
Bezier-Bernstein approximation, 181 
Bezier-Bernstein surfaces, 1 84 
Bilevel image, 8 
Bilinear interpolation, 237, 246 
Bitmap font, 45 

Blending functions (see Basis functions) 
Boundary-fill, 42, 43 
Bounding box, 201 
Bounding volume, 259 
Bounding volume extension, 259 
Bresenham's circle algorithm, 3 1 
Bresenham’s line algorithm, 27 
Brightness, 230 
Bump mapping, 240 

C curve, 5 1 

Cabinet projection, 133 


Canonical clipping, 155 
Cartesian coordinates: 

correlation with homogeneous coordinates, 308 
three-dimensional, 298 
two-dimensional, 273 
Cathode ray tube (CRT), 9 
electrostatic deflection, 10 
magnetic deflection, 10 
Cavalier projection, 133 
Center of projection, 129 
Character size, 45 
pica, 45 
point, 45 

Chromaticity coordinates, 233 
CIE chromaticity diagram, 232 
CIE XYZ color model. 232 
Circle: 

Bresenham’s circle algorithm, 3 1 
eight-way symmetry, 29 
midpoint circle algorithm, 33 
Circle, equation of: 
non-parametric, 30 
parametric, 31 
Clamped spline, 180 
Clipping: 

line, 91, 94, 170, 172 
point, 91 
polygon, 96 
three-dimensional, 155 
two-dimensional, 91 
Clipping categories, 91 
Clipping planes, 155 
back (far), 155 
front (near), 155 
Clipping window, 90 
CMY color model, 7 

Cohen-Sutherland line-clipping algorithm, 91, 170 
Coherence, 204 
area, 205 
edge, 204 
scan-line, 204 
spatial, 205 

Color gamut mapping, 233 
Color map, 9 

Color matching functions, 232 
Color model: 
additive, 7 
CIE XYZ, 232 
CMY, 7 
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Color model (Coni.): 

NTSC YIQ, 234 
RGB, 7 
subtractive, 7 
Complex numbers, 16 
Composing functions, 284 
Composite transformation matrix (CTM), 75, 118 
Compression, 15 
run-length encoding (RLE), 15 
Computer-human interaction, 4 
Concave polygon, 96 
Connected: 

4-connected, 42 
8-connected, 42 
Constant shading, 237 
Control electrode, 10 
Control points, 181 
Convex polygon, 96 
Coons surfaces, 185 
Coordinate system: 

Cartesian coordinates, 273, 299 
homogeneous, 307 
left-handed, 299 
polar coordinates, 277 
right-handed, 299 
three-dimensional, 298 
two-dimensional, 273 
Coordinate transformation, 68, 71, 117 
inverse, 73 

matrix description, 74 
reflection (see Reflection) 
rotation (see Rotation) 
scaling (see Scaling) 
translation (see Translation) 
viewing transfonnation (see Viewing 
transformation) 

Cross product, 304 
Cross-section curve, 1 86 
CRT (see Cathode ray tube) 

CTM (see Composite transfonnation matrix) 
Curve design, 176 
Curve segment, 177 
Curved surface, 176 
Curves, equation of, 300 
explicit fonn, 300 
implicit form, 300 
non-parametric, 300 
parametric, 300 
Cyclic spline condition, 1 8 1 
Cylinder surface, equation of, 302 

Deflection: 
electrostatic, 10 
magnetic, 10 

Depth buffer (see Z-buffer) 

Depth buffer algorithm (see Z-buffer algorithm) 


Depth comparison, 197 
Depth value, 199 
Device coordinate system, 89 
Diffuse reflection, 235 
Digital image, 6 
Dimetric projection, 132 
Direct coding, 8 
Direction of projection, 1 32 
Display monitor, 9 

Distance from a point to a plane, 306, 314 
Distance formula, 273, 300 
Distributed ray-tracing, 261 
Dithering, 13 
dither matrix, 13 
dither pattern, 12 
Dot product, 280, 304 
Double buffering, 101 
Dropout, 48 


Edge coherence, 204 
Edge list, 44 
Electron gun, 10 
Ellipse: 

four-way symmetry, 35 
midpoint ellipse algorithm, 37 
Ellipse, equation of: 
non-parametric, 35 
parametric, 36 
Environment mapping, 261 
Error diffusion: 

Floyd-Steinberg, 13 
Excitation purity, 23 1 
Explicit vertex list, 176 
Extent, 201 

Filtering, 49 
lowpass filtering, 50 
post-filtering, 49 
pre-filtering, 49 
Flicker, 10 
Flood-fill, 42, 43 

Floyd-Steinberg error diffusion, 1 3 

Fluorescence, 10 
Focusing electrode, 1 0 
Font, 45 
bitmap, 45 
outline, 46 
TrueType, 49 
Framebuffer, 10 

Function (mapping, operator, transfonnation), 283 
composition of, 284 
domain, 283 
graph of, 283 
inverse, 284 
matrix, 284 
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Function (mapping, operator, transformation) (Com.) 
range, 283 

(See also Composing functions) 

Geometric representation, 1 74 
curved surface, 176 
curved surface patch, 1 84 
line, 174 
point, 174 
polygon, 174 
polygon mesh, 175 
polyhedron, 176 
polyline, 174 
quadric surface, 186 
wireframe, 175 

Geometric transformation, 68, 114 
matrix description, 74 
Gouraud shading, 237 
Graphics pipeline: 
three-dimensional, 3, 159 
two-dimensional, 2, 99 
Gray axis, 7 
Gray-scale image, 8 
Guiding nets, 1 84 
Bezier — B-spline surfaces, 185 
Bezier-Bernstein surfaces, 1 84 

Halftone, 12 

Halftone approximation, 12 
Halftoning, 11 

Hermitian cubic polynomial interpolation, 179 
Hidden surfaces, 197, 251 
Homogeneous coordinates, 307 
ideal point, 308 
line, 307 
point, 75, 307 
Hue, 230 

Ideal point, 308 
llluminant C, 233 
Illumination model, 4, 234, 251 
local, 4, 234 
global, 4, 234, 251 
Image, 6 

Image, representation of: 
direct coding, 8 
lookup table, 9 
Image files, 14 
Image processing, 4 
Image space, 1 

Inside-outside test of view volumes, 158 
Inside test for polygons, 96 
Instance, 76 

Instance transformation, 76 
Instancing, 76 
Interlaced, 14 
Interlacing, 10 


Interpolating surface patches, 1 85 
Coons surface, 185 
lofted surface, 185 
Interpolation, 179 
Hermitian cubic, 179 
Lagrange polynomial, 179 
spline, 180 

Interpolative shading methods (see Shading methods) 

Intersection, computation of, 256 

Inverse function, 284 

Inverse matrix, 282 

Isometric projection, 1 32 

Julia sets, 18 

Knots (nodes), 1 77 
Koch curve, 52, 65 

Lagrange polynomial, 177 
interpolation, 179 
Law of Cosines, 274 
Left-right test, 96 

Liang-Barsky line-clipping algorithm, 94, 172 
Light, 229 
Line, equation of: 
parametric, 94 
slope-intercept, 26 
Line, scan-conversion of: 

Bresenham's line algorithm, 27 
digital differential algorithm (DDA), 27 
Line clipping: 

Cohen-Sutherland algorithm, 91, 170 
midpoint subdivision, 93 
Liang-Barsky algorithm, 94, 1 72 
Line segment, 26 
Linear blending, 186, 195 
Lofted surface, 185 
Lofting, 185 
Lookup table, 9 
Lowpass filtering, 50 
Luminance, 230 

Mandelbrot set, 16 
Mathematical surfaces, 209 
Matrices, 281 
addition, 281 

concatenation (see Matrix concatenation) 
homogeneous fonn, 308 
identity, 282 
inverse, 282 

multiplication (see Matrix multiplication) 
scalar multiplication, 281 
transpose, 282 

Matrix concatenation, 75, 281 
(See also Matrix multiplication) 

Matrix multiplication, 281 
composition of matrix functions as equivalent to, 
284 
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Midpoint circle algorithm, 33 
Midpoint ellipse algorithm, 37 
Midpoint subdivision clipping algorithm, 93 
Modeling, 174 
additive, 176 
solid, 176 
subtractive, 176 
Monitor, 9 
Motion blur, 262 

Natural coordinate vector, 279, 303 
Natural spline, 180 
Nested instances, 76 
Normal vector, 235, 238, 306 
Normalization transformation, 90 
Normalized device coordinate system, 89 
Normalized perspective to parallel transform, 1 98, 
213 

Nonnalizing transformation for canonical view 
volumes, 157 

NTSC YIQ color model, 234 
Object space, 1 

Oblique (parallel) projection, 129, 132 
Orientation (right- and left-handed), 299 
Orthographic (parallel) projection, 129, 132 
Outline font. 46 
Overstrike, 58 

Painter’s algorithm, 200 
Panning (see Animation) 

Parallel projection, 132 
axonometric, 132 
cabinet, 133 
cavalier, 133 
dimetric, 132 
isometric, 132 
oblique, 132 
orthographic, 132 
trimetric, 132 
Parallel vectors, 278 
Parametric equation, 300 
circle, 31 
curve, 300 
line, 94, 301 
surface, 301 
Persistence, 10 

Perspective to parallel transform, 198 
Perspective anomalies, 130 
perspective foreshortening, 130 
topological distortion, 131 
vanishing points, 129, 131 
view confusion, 131 
Perspective foreshortening, 129, 130 
Perspective projection, 129 
one principal vanishing point, 141 
three principal vanishing points, 142 
two principal vanishing points, 141 


Phong: 
formula, 236 
model, 234 
shading, 238 
Phosphor, 9 
Phosphorescence, 10 
Pica, 45 

Picket fence problem, 48 
Pinhole camera, 251 
Pitch, 1 1 
Pixel, 6 
coordinates, 6 
Pixel phasing, 51 
Planar polygon, 175 
Planes, equation oft 302 
Point clipping, 91 
Point light, 234 
Polar coordinates, 277 
Polygon, 96, 174 
orientation, 96 
Polygon clipping, 96 
Sutherland-Hodgman algorithm, 96 
Weiler-Atherton algorithm, 97 
Polygon mesh, 1 75 
Polyhedron, 176 
faces of, 176 

hidden surfaces of, 200, 225 
Polyline, 174 
Polynomial, 177 
piecewise, 177 

Polynomial basis function, 177 
Positively oriented polygon, 96 
Post-filtering, 49 
Pre-filtering, 49 
Primary ray, 252 
Principal vanishing point, 129 
Printer, 1 1 

Projected texture, 239 
Projection, 128 
center of, 1 29 
classification of, 129 
direction of, 132 

(See also Parallel projection; Perspective projection) 
Projection of a vector onto a plane, 3 1 6 
Projection plane, 128 
Projective plane, 307 
Projector, 128 

Quadric surfaces, 302 
equations of, 186 

Raster, 6 
Rasterization, 25 
Ray, 251 
primary, 252 
reflected, 252 
secondary, 252 
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Ray ( Cont .): 
shadow, 252 
transmitted, 252 

Ray, vector representation of, 253 
Ray tracing, 251 
adaptive depth control, 259 
bounding volume extension, 259 
hierarchy of bounding volumes, 259 
spatial coherence/spatial subdivision, 260 
Ray-surface intersection, 256 
arbitrary plane, 256 
coordinate system plane, 256 
cylinder, 272 

elliptic paraboloid, 268, 269 
general implicit surface, 258 
region, 265 
sphere, 257 
Receptor cells, 231 
cones, 23 1 
rods, 23 1 
Reflected ray, 252 
Reflection, 235 
blurry, 262 
diffuse, 235 
specular, 235 

three-dimensional, 124, 125, 244 
two-dimensional, 70, 81, 82 
Reflection coefficients, 235 
Refresh, 10 
interlacing, 10 
refreshing rate, 10 
Region code, 92 
Region filling, 42 
boundary-fill, 43 
flood-fill, 43 
scan-line, 44 
Resolution, 6 
Retrace, 10 
horizontal, 10 
vertical, 10 
RGB color model, 7 
Rotation, 69, 115 
three-dimensional, 115 
two-dimensional, 69, 72, 78, 79 
Run-length encoding (RLE), 15 

Saturation, 231 
Scaling, 70, 115 
homogeneous, 70 
magnification, 70, 115 
reduction, 70, 115 
three-dimensional, 1 1 5 
two-dimensional, 70, 72, 75 
uniform, 70 
Scan conversion, 25 
characters, 45 
circles, 29 


Scan conversion (Coni.): 
ellipses, 35 

incremental methods, 27 
lines, 26 
points, 25 
polygons, 42 
rectangles, 41 
sectors and arcs, 40 
Scan-line algorithms: 
for hidden surface removal, 203 
for region filling, 44 
Scan-line coherence, 204 
Secondary ray, 252 
Shading methods, 236 
constant, 237 
Gouraud, 237 
Phong, 238 
Shadow mask, 1 1 
Shadow ray, 252 
Shearing transformation, 83 
Side of a plane, 302, 312 
Sierpinski gasket, 52, 66 
Snell’s law, 252, 271 
Soft shadow, 262 
Solid modeling, 1 76 
Solid texture, 240 
Spatial coherence, 205, 260 
Spatial subdivision, 260 
Spectral energy distribution function, 229 
Spectral reproduction, 229 
Specular reflection, 235 
Splines, 180 
anticyclic, 181 
clamped, 180 
cubic, 180 
cyclic, 181 
natural, 180 
quadratic, 180 

Standard perspective projection, 130 
Standard white D 65 , 243 
Stochastic Supersampling, 261 
Subdivision algorithms, 93, 207 
Supersampling, 50, 260 
adaptive, 261 
stochastic, 261 
Surface patch, 185 

(See also Interpolating surface patches) 

Surfaces, equations of, 301 
cylinder, 302 
explicit form, 301 
implicit form, 302 
non-parametric, 301 
parametric, 302 

planes, 302, 306, 312, 313, 314 
quadric surfaces, 1 86, 302 
sphere, 257, 309 

Sutherland-IIodgman polygon-clipping algorithm, 96 
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Terrain generation, 1 89 
Texture, 4, 239 
mapping, 240, 248, 249, 250 
projected, 239 
solid, 240 
Texture map, 239 

Three-dimensional transformations, 114 
Three-dimensional viewing, 151 
Tilting transformation, 1 1 8 
Transformation, 68, 114 
composite, 73, 117 
coordinate, 68, 117 
geometric, 68, 114 
instance, 76, 118 
normalization, 90 
viewing, 90, 158 
workstation, 90 

Transformation matrix, 74, 115 
Transforming curves and surfaces, 186 
Translation, 69, 72, 114 
matrix form, 75, 115 
three-dimensional, 114 
two-dimensional, 69, 72 
Translation vector, 69, 72, 114 
Translucency, 262 
Transmitted ray, 252 
Trichromatic generalization theory, 23 1 
Trimetric projection, 132 
True color representation, 8 
TrueType font, 49 

Two-dimensional transformations, 68 
Typeface, 45 

Unequal brightness, 47 
Unit vector, 304 
Up vector, 152 
User interface, 4 

Vanishing points, 129, 131 
Vector components, 279, 303 
Vector equation of: 
line, 305 
plane, 306 
ray, 253 


Vector equation of (Coni ): 

sphere, 257 
Vectors, 278, 303 
addition, 278 
angle between, 280 
cross product, 304 
dot product, 280, 304 
length (magnitude), 279, 304 
normal, 306 

orthogonal, 280, 293, 304, 309, 319 
parallel, 278 
unit, 304 
Vertex list, 174 
View plane, 128 

View plane coordinate system, 152 
View plane nonnal, 151 
View reference point, 129 
View volume, 151, 154 
canonical, 157 
parallel, 154 
perspective, 154 

Viewing coordinate system, 89, 152 
up vector, 152 

Viewing transformation, 90, 158 
normalization transformation, 90 
three-dimensional, 158 
two-dimensional, 90 
Viewport, 89 
Virtual display device, 89 
Visible polygon face, 227 

Wavelength, 229 
dominant, 230 

Weiler-Atherton polygon-clipping algorithm, 97 
Window, 89 

Window-to-viewport mapping, 90 

Wireframe, 175 

Workstation transformation, 90 

World coordinate system, 89 

Wright algorithm for mathematical surfaces, 2 1 1 

Z-buffer, 199 

Z-buffer algorithm (depth buffer algorithm), 199 
Zooming (see Animation) 



